MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER

Time series of apartment prices in the Czech Republic are available only in the partial statistics of the Statistical Office. Apartment prices are presented mainly in the articles and comments from the real estate agents. Data unavailability leads to a small number of statistically oriented public...

Full description

Bibliographic Details
Main Authors: Alena POZDÍLKOVÁ, Jaroslav MAREK, Marie NEDVĚDOVÁ
Format: Article
Language:English
Published: Technical University of Kosice 2020-05-01
Series:Acta Electrotechnica et Informatica
Subjects:
Online Access:http://www.aei.tuke.sk/papers/2020/1/2_Pozdilkova.pdf
id doaj-1e283a81b85b476bac48153668dbdece
record_format Article
spelling doaj-1e283a81b85b476bac48153668dbdece2020-11-25T02:57:27ZengTechnical University of Kosice Acta Electrotechnica et Informatica1335-82431338-39572020-05-0120191410.15546/aeei-2020-0002MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVERAlena POZDÍLKOVÁ0Jaroslav MAREK1Marie NEDVĚDOVÁ2Department of Mathematics and Physics, Faculty of Electrical Engineering and Informatics, University of Pardubice, Studentská 95, 532 10 Pardubice,Czech RepublicDepartment of Mathematics and Physics, Faculty of Electrical Engineering and Informatics, University of Pardubice, Studentská 95, 532 10 Pardubice,Czech RepublicDepartment of Mathematics and Physics, Faculty of Electrical Engineering and Informatics, University of Pardubice, Studentská 95, 532 10 Pardubice,Czech RepublicTime series of apartment prices in the Czech Republic are available only in the partial statistics of the Statistical Office. Apartment prices are presented mainly in the articles and comments from the real estate agents. Data unavailability leads to a small number of statistically oriented publications on the real estate market. The main aim of our paper is thus to introduce a software solution for parsing real estate websites. Of course, we are only able to retrieve data on demanded prices from advertisements, actual prices are not achieved. By automatic polling, we are able to get data on the floor area of advertised apartments and the asked purchase price. A Python script was written to retrieve data from sreality.cz. The MongoDB database is used to store ads. New ads are saved directly to the database. Then, daily average apartment price of 1 square meter for each municipality are calculated. The filtered data can then be displayed or exported to a file via the web interface. In the statistical analyses, we present graphs showing the development of apartment prices and the number of advertisements in various municipalities of the Czech Republic in the period of 09/2018 – 12/2019. Next, we address the issue of clustering of municipalities with regard to the similarity of relative price changes. http://www.aei.tuke.sk/papers/2020/1/2_Pozdilkova.pdfweb page parsingreal estate markettime seriesapartment pricesfloor areapurchased price
collection DOAJ
language English
format Article
sources DOAJ
author Alena POZDÍLKOVÁ
Jaroslav MAREK
Marie NEDVĚDOVÁ
spellingShingle Alena POZDÍLKOVÁ
Jaroslav MAREK
Marie NEDVĚDOVÁ
MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER
Acta Electrotechnica et Informatica
web page parsing
real estate market
time series
apartment prices
floor area
purchased price
author_facet Alena POZDÍLKOVÁ
Jaroslav MAREK
Marie NEDVĚDOVÁ
author_sort Alena POZDÍLKOVÁ
title MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER
title_short MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER
title_full MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER
title_fullStr MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER
title_full_unstemmed MONITORING OF APARTMENT PRICES IN THE CZECH REPUBLIC THROUGH PARSING A WEB ADVERTISING SERVER
title_sort monitoring of apartment prices in the czech republic through parsing a web advertising server
publisher Technical University of Kosice
series Acta Electrotechnica et Informatica
issn 1335-8243
1338-3957
publishDate 2020-05-01
description Time series of apartment prices in the Czech Republic are available only in the partial statistics of the Statistical Office. Apartment prices are presented mainly in the articles and comments from the real estate agents. Data unavailability leads to a small number of statistically oriented publications on the real estate market. The main aim of our paper is thus to introduce a software solution for parsing real estate websites. Of course, we are only able to retrieve data on demanded prices from advertisements, actual prices are not achieved. By automatic polling, we are able to get data on the floor area of advertised apartments and the asked purchase price. A Python script was written to retrieve data from sreality.cz. The MongoDB database is used to store ads. New ads are saved directly to the database. Then, daily average apartment price of 1 square meter for each municipality are calculated. The filtered data can then be displayed or exported to a file via the web interface. In the statistical analyses, we present graphs showing the development of apartment prices and the number of advertisements in various municipalities of the Czech Republic in the period of 09/2018 – 12/2019. Next, we address the issue of clustering of municipalities with regard to the similarity of relative price changes.
topic web page parsing
real estate market
time series
apartment prices
floor area
purchased price
url http://www.aei.tuke.sk/papers/2020/1/2_Pozdilkova.pdf
work_keys_str_mv AT alenapozdilkova monitoringofapartmentpricesintheczechrepublicthroughparsingawebadvertisingserver
AT jaroslavmarek monitoringofapartmentpricesintheczechrepublicthroughparsingawebadvertisingserver
AT marienedvedova monitoringofapartmentpricesintheczechrepublicthroughparsingawebadvertisingserver
_version_ 1724711219989839872