Skip to content

R package for SiRCle multiomics integration

License

GPL-3.0, MIT licenses found

Licenses found

GPL-3.0
LICENSE
MIT
LICENSE.md
Notifications You must be signed in to change notification settings

ArianeMora/SiRCleR

Repository files navigation

Lifecycle: maturing GitHub issues

Short Introduction

The Signature Regulatory Clustering (SiRCle) method has first been released as a biorxiv preprint and since then we have enhanced the method further as part of our revision in genome medicine.
The SiRCle method was first developed in 2022 to integrate DNA methylation, RNA-seq and proteomics data at the gene level to deconvolute the association between dysregulation within and across possible regulatory layers (DNA methylation, transcription and/or translation). The results of the study were published on biorxiv in a shared effort by Mora & Schmidt et. al.. As part of the revisions in Genome Medicine, we have updated some parts of the SiRCle clustering method and extended our methods section in the manuscript. We also added new data and a PanCan analysis. All of this can be found in Mora&Schmidt et al. [@Mora_Schmidt2024].  You can find the detailed code for both manuscripts on our website or in our SiRCleManuscript GitHub repository.

The SiRCle method integrates DNA methylation, RNA-seq and proteomics data at the gene level to deconvolute the association between dysregulation within and across possible regulatory layers (DNA methylation, transcription and/or translation).Based on logical regulatory rules, sircleRCM, genes are grouped in SiRCle clusters based on the layer (DNA methylation, transcription and/or translation) where dys-regulation first occurs. Using the output of sircleRCM, the SiRCle clusters, one can find the primary biological processes altered by applying Over Representation Analysis (ORA) (sircleORA) and the drivers behind it using Transcription Factor (TF) analysis (sircleTF). Lastly, to compare patient’s subsets (e.g. based on stage), we found that integrating across the data layers prior to performing differential analysis and biological enrichment better captures the biological signal. Hence we use a variational autoencoder (VAE) to learn gene-wise relationships across the three data layers to obtain an integrated value for each gene (sircleVAE). Unsing the integarted value we next perform a Mann-Whitney U test to identify genes with a significant integrated difference between the patient’s groups.

SiRCle uses logical regulatory rules, sircleRCM to group genes in SiRCle clusters based on the data layer (DNA methylation, transcription and/or translation) where dys-regulation first occurs.
Using the output of sircleRCM, the SiRCle clusters, one can find the primary biological processes altered by applying Over Representation Analysis (ORA) and the drivers behind it using Transcription Factor (TF) analysis.
Lastly, to compare patient’s subsets (e.g. based on stage), we found that integrating across the data layers prior to performing differential analysis and biological enrichment better captures the biological signal. Hence we use a variational autoencoder (VAE) to learn gene-wise relationships across the three data layers to obtain an integrated value for each gene (sircleVAE). Using the integrated value we next perform a Mann-Whitney U test to identify genes with a significant integrated difference between the patient’s groups.

Tutorials

We have generated several vignettes showcasing the the usage of SiRCleR using the publicly available datasets as in the manuscript Mora&Schmidt et al., which are included as example data within SiRCleR. You can find those tutorial on the top under the “Tutorials” button, where you can follow a specific user case example and learn more about how to choose the settings including the Background Method, input data thresholds and Regulation Grouping. You can also follow the links below:
* ChooseSettings
* DataAnalysisExample

If you are seeking to also apply the variational autoencoder on the SiRCle clustering output, please visit the python package or use reticulate in R. You can find the python package on the Python SiRCle.

Install

SiRCleR is an R package. 1. Install Rtools if you haven’t done this yet, using the appropriate version (e.g.windows or macOS). 2. Install the latest development version from GitHub with: SiRCleR package direcly in R: devtools::install_github("https://github.com/ArianeMora/SiRCleR") library(SiRCleR)

Dependencies

While we have done our best to ensure all the dependencies are documented, if they aren’t please let us know and we will try to resolve them.

Windows specifications

Note if you are running Windows you might have an issue with long paths, which you can resolve in the registry on Windows 10:
Computer Configuration > Administrative Templates > System > Filesystem > Enable Win32 long paths
(If you have a different version of Windows, just google “Long paths fix” and your Windows version)

Questions & Issues

Please post questions and issues related to SiRCleR functions on the Issues section of this GitHub repository.

Reproducibility

If you want to reproduce the results of of our publication, please use the python package version found here: https://doi.org/10.1101/2022.07.02.498058

Citation

If you use this please cite our manuscript Mora&Schmidt et al.. Ariane Mora & Christina Schmidt, Brad Balderson, Christian Frezza & Mikael Bodén. 2024. “SiRCle (Signature Regulatory Clustering) Model Integration Reveals Mechanisms of Phenotype Regulation in Renal Cancer.” Genome Medicine 16 (1): 144. https://doi.org/10.1186/s13073-024-01415-3.

About

R package for SiRCle multiomics integration

Resources

License

GPL-3.0, MIT licenses found

Licenses found

GPL-3.0
LICENSE
MIT
LICENSE.md

Stars

Watchers

Forks

Packages

No packages published

Languages