This project sharing Notebook on AI, Bigdata, Realtime analytic, Monitoring ... Instructions for building a lab test environment with multiple components. Include but not limit: haddoop, spark, kafka.
├── jupyterlab -> contains config jupyterlab
├── share_storages
├── lab -> All notebook
├── data -> public data like image, ai-model, csv ..
├── dataset
├── dogs-vs-cats
...
├── image
├── model
├── docker-compose.yaml -> run server
...
🔥 Note: 🔥 Corresponding to each article is a Notebook of the same name in the path share_storages/lab
1. Spark Distributed ML model with Pandas UDFs ---> Notebook (2022/03/22)
2. Cats vs Dogs Classification using CNN Keras ---> Notebook (2022/03/28)
git clone https://github.com/dnguyenngoc/lab-spark.git \
&& cd lab-spark
docker-compose -f <docker-compose file .yaml> up
Service | URL | user/pass |
---|---|---|
Jupyterlab | http://localhost:8888 | 1q2w3e4r |
- Email-1: duynnguyenngoc@hotmail.com - Duy Nguyen ❤️ ❤️ ❤️