-
Notifications
You must be signed in to change notification settings - Fork 1
Workflow for data harmonization
Data harmonization is the process of transforming data from the original collection of machine readable information to a standardize format. This original data collection may be composed of a machine-readable set of relational data tables and/or graphical (list-based) data including both primary data and metadata objects. The transformation is done via a set of scripts and manually created annotation file. This final format is a single data table with the following columns: id(s) - of variable - is type - with entry. Provenance in this stage is maintained by a read script, an annotation file documented any manual data extractions or assignments, and documentation of who did this work.
*This page is a work in progress transitioning from Workflow for new data additions
-
Workflow for data rescue
- Understand data source
- Transcribe data
- Review transcription
-
Workflow for new data additions
- Find data
- Open ticket
- Evaluation
- Annotations
- Read scripts
- Integration
- QA/QC
- Merge to main
- Publish
- Data collections