🌊 Git-like Version Control for Data with Nessie, Iceberg, and Spark
distributed-systems apache-spark etl s3 data-engineering minio dataops block-storage time-travel data-pipelines data-versioning etl-pipeline spark-etl apache-iceberg git-for-data data-lakehouse apache-nessie atomic-etl table-format branch-based-development
-
Updated
Jan 21, 2025 - Jupyter Notebook