Skip to content

coleygroup/ml-for-tb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ml-for-tb

Project for sharing surrogate models for TB inhibitors, data sharing, and Jupyter notebooks for prototyping.

Data

the data directory can be used for small (a few MB) CSV files for easy access. Larger files can be shared with figshare, gdrive, etc. and downloaded on your local machine or cluster.

TODO: raw activity data

docking validation data: PDB structures used for redocking, config files, images of docked poses, docking output

Links to data:

  • TBD

Env setup

As the project develops, environment.yml and requirements.txt can be used to keep python dependencies organized. This could turn into scripts and an installable python package later on.

Basic, flexible folder structure

├── data
│   └── 0_raw            <- The original data. Good for small (a few MB) CSV files and data for debugging. 
│
├── docs                
│   └── data_dictionaries            <- Documentation that briefly explains data. 
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-ncf-initial-data-exploration`.

About

surrogate models for TB inhibitors

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published