A data repository for multiple studies' data developed by the Department of Biostatistics and Bioinformatics (DBB) at George Washington University.
- Citation: A reference data library for clinical trials and omics data, Department of Biostatistics and Bioinformatics, Milken Institute School of Public Health, George Washington University, 2020
This repository includes a library of datasets from study's performed by faculty members of DBB. Each dataset has:
- a brief description
- link and citation to related publications, and
- a wiki page including instructions (R and Python) to load and performing data analyses from the library.
The datasets are for different human disease, environmental studies, and include omics and clinical trail datatypes.
The data library is implemented as an R package.
Instruction to install it using RStudio.
If you do not have RStudio installed on your computer, then please find information to install it from here
I RStudio console you can run the following R commands to install requirements and the library:
-
install devtools: a tool enables installing packages from Github
library(devtools)
-
install GWDBB from the GitHub repository
install_github('GWCBI/GWDBB')
-
load the library
library(GWDBB)
-
load a dataset of interest e.g. micorbail species from Human Microbiome Project
data("HMP1_II_Metadata")
- HMP1-II: a dataset of microbial species abundance and related metadata
- Breast milk and infant stool omics
- Treatment Options for Type II Diabetes in Adolescents and Youth (TODAY) study
- Systemic Lupus Erythematosus: Gene Expression and Expression of Human Endogenous Retroviruses
- COVID-19 Community Research Partnership
Please submit any issue you face when using the data library using Issues