This project is a collection of scrapers that scrape real estate data from different websites. The data is then stored in a database and used for notifying users about new listings based on their preferences.
- sreality.cz
- bezrealitky.cz
- reality.idnes.cz
- Svoboda-Williams.com
- realitymix.cz
- archer-reality.cz
- reality.cz
- eurobydleni.cz
- https://www.maxima.cz/nabidka-nemovitosti/?at=1&kraj=praha
- Remax-czech.cz
- www.mmreality.cz
- https://www.engelvoelkers.com/
All the pipelines are run in the same docker container, to build the image and publish it to Docker Hub run the following commands:
docker buildx build --platform linux/amd64 -t bakoad/real-estate-pipelines:latest .
docker push bakoad/real-estate-pipelines:latest
To generate a new scraper run the following command in from the data_infrastructure directory:
python automations/generate_new_scraper.py
The project is a collection of scrapers that scrape real estate data from different websites. Along side the scrappers there is a website where users can set their preferences for real estate listings. The data is then stored in a database and used for notifying users about new listings based on their preferences.