Skip to content

bakobako/bytovy-lovec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Prague Real Estate Data Scraping project

This project is a collection of scrapers that scrape real estate data from different websites. The data is then stored in a database and used for notifying users about new listings based on their preferences.

Data sources that are scraped

  • sreality.cz
  • bezrealitky.cz
  • reality.idnes.cz

Data sources that are planned to be scraped

Building the image for Prefect pipelines

All the pipelines are run in the same docker container, to build the image and publish it to Docker Hub run the following commands:

docker buildx build --platform linux/amd64 -t bakoad/real-estate-pipelines:latest .
docker push bakoad/real-estate-pipelines:latest

Automations

To generate a new scraper run the following command in from the data_infrastructure directory:

python automations/generate_new_scraper.py

For chatgpt definiton of the project

The project is a collection of scrapers that scrape real estate data from different websites. Along side the scrappers there is a website where users can set their preferences for real estate listings. The data is then stored in a database and used for notifying users about new listings based on their preferences.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published