Skip to content

akram-mohammed/candis

 
 

Repository files navigation


A data mining suite for DNA microarrays

candis is an open source data mining suite (released under the GNU General Public License v3) for DNA microarrays that consists of a wide collection of tools you require, right from Data Extraction to Model Deployment. candis is built on top of the toolkit - CancerDiscover written by the bioinformaticians at HelikarLab.

WARNING: candis currently is still in dev mode and not production-ready yet. In case if you run across bugs or errors, raise an issue over here.

Table of Contents

Installation

To install candis right from scratch, check out our exhaustive guides:

Usage

Launching the RIA (Rich Internet Application)

via CLI

$ candis

OR

$ python -m candis

via Python

>>> import candis
>>> candis.main()

Using the CLI (Command Line Interface)

$ candis --cdata path/to/data.cdata --config path/to/config.json

Features

  • Converting a CDATA to an ARFF file

    >>> import candis
    >>> cdata = candis.cdata.read('path/to/data.cdata')

    Then, simply use the CData.toARFF API:

    >>> cdata.toARFF('path/to/data.arff')
  • Running a Pipeline.

    >>> pipe = candis.Pipeline()
    >>> pipe.run(cdata)
    >>> while pipe.status == candis.Pipeline.RUNNING:
    ...     # do something while pipeline is running

Dependencies

  • Production Dependencies
    • R
    • WEKA (NOTE: Requires Java)
    • Python 3.6+ and PIP (Python's Package Manager)
  • Development Dependencies

License

This software has been released under the GNU General Public License v3.

About

🎀 A data mining suite for DNA microarrays.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 59.0%
  • Python 31.1%
  • Jupyter Notebook 4.2%
  • HTML 1.7%
  • CSS 1.7%
  • R 1.6%
  • Other 0.7%