A Cookiecutter Template for creating a CTData frictionless and tidy dataset.
- GitHub repo: https://github.com/CT-Data-Collaborative/ctdata-dataset-cookiecutter
- MIT License
Install cookiecutter if you haven't already:
pip install -U cookiecutter
Create a new dataset:
cookiecutter https://github.com/CT-Data-Collaborative/ctdata-dataset-cookiecutter
Then:
- Create a repo and put it there.
- Install the dev requirements into a virtualenv. (pip install -r requirements.txt)
Now you are set up to start processing data. The resulting Tabular Data Package includes a readme that contains additional information about how to specify metadata, test, etc.