Skip to content

This project build spark and jupyterlab for easy setup Dev environment ⭐

Notifications You must be signed in to change notification settings

dnguyenngoc/lab-spark

Repository files navigation

lab-spark

This project sharing Notebook on AI, Bigdata, Realtime analytic, Monitoring ... Instructions for building a lab test environment with multiple components. Include but not limit: haddoop, spark, kafka.

Structure

├── jupyterlab -> contains config jupyterlab
├── share_storages
    ├── lab -> All notebook
    ├── data -> public data like image, ai-model, csv ..
        ├── dataset
            ├── dogs-vs-cats
            ...
        ├── image
        ├── model
├── docker-compose.yaml -> run server
...

Posts and Notebooks

🔥 Note: 🔥 Corresponding to each article is a Notebook of the same name in the path share_storages/lab

Start Lab

1. Clone repo

git clone https://github.com/dnguyenngoc/lab-spark.git \
    && cd lab-spark 

2. Build with docker

docker-compose -f <docker-compose file .yaml> up
Service URL user/pass
Jupyterlab http://localhost:8888 1q2w3e4r

Test

Contact Us

About

This project build spark and jupyterlab for easy setup Dev environment ⭐

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published