Skip to content

Pipeline to convert long-read sequencing data to a representative transcriptome for a cancer type and perform primary sequence characterization of the novel ORFs.

License

Notifications You must be signed in to change notification settings

siegmundwang/isoseq2orf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

isoseq2orf

Pipeline to convert long-read sequencing data to a representative transcriptome for a cancer type and perform primary sequence characterization of the novel ORFs. The flowchart of the pipeline is shown here:

image

000_ccs2gtf.sh: This script converted raw data from the Pacbio sequencer to the master transcriptome.

001_gtf2orf.sh: This script performed QC, predicted the ORFs, and performed primary sequence characterisation of the master transcriptome.

002_gtf2qnt.sh: This script performed quantification of the master transcriptome based on external short-read RNA-seq datasets.

003_orf2ms.sh: This script performed MS/MS validation of predicted novel ORFs.

About

Pipeline to convert long-read sequencing data to a representative transcriptome for a cancer type and perform primary sequence characterization of the novel ORFs.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages