A demo project on batch data-parallel processing using Apache Beam and Python
python count pipeline transformations kaggle batch apache-beam google-colab colab-notebook pcollections beam-python beam-sdk groupby-transformation
-
Updated
Mar 26, 2021 - Jupyter Notebook