NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision

被引:91
作者
Chuang, Trees-Juen [1 ]
Wu, Chan-Shuo [1 ]
Chen, Chia-Ying [1 ]
Hung, Li-Yuan [1 ]
Chiang, Tai-Wei [1 ]
Yang, Min-Yu [1 ]
机构
[1] Acad Sinica, Genom Res Ctr, Div Phys & Computat Genom, Taipei 11529, Taiwan
关键词
BCR-ABL FUSION; PAIRED-END; GENE FUSIONS; CHIMERIC TRANSCRIPTS; RECURRENT FUSION; READ ALIGNMENT; BREAST-CANCER; SEQ DATA; ALGORITHM; DISCOVERY;
D O I
10.1093/nar/gkv1013
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of RNA-seq data often detects numerous 'non-co-linear' (NCL) transcripts, which comprised sequence segments that are topologically inconsistent with their corresponding DNA sequences in the reference genome. However, detection of NCL transcripts involves two major challenges: removal of false positives arising from alignment artifacts and discrimination between different types of NCL transcripts (trans-spliced, circular or fusion transcripts). Here, we developed a new NCL-transcriptdetecting method ('NCLscan'), which utilized a stepwise alignment strategy to almost completely eliminate false calls (>98% precision) without sacrificing true positives, enabling NCLscan outperform 18 other publicly-available tools (including fusion-and circular-RNA-detecting tools) in terms of sensitivity and precision, regardless of the generation strategy of simulated dataset, type of intragenic or intergenic NCL event, read depth of coverage, read length or expression level of NCL transcript. With the high accuracy, NCLscan was applied to distinguishing between trans-spliced, circular and fusion transcripts on the basis of poly(A)-and nonpoly(A)-selected RNA-seq data. We showed that circular RNAs were expressed more ubiquitously, more abundantly and less cell type-specifically than trans-spliced and fusion transcripts. Our study thus describes a robust pipeline for the discovery of NCL transcripts, and sheds light on the fundamental biology of these noncanonical RNA events in human transcriptome.
引用
收藏
页数:16
相关论文
共 98 条
[1]   Bellerophontes: an RNA-Seq data analysis framework for chimeric transcripts discovery based on accurate fusion model [J].
Abate, Francesco ;
Acquaviva, Andrea ;
Paciello, Giulia ;
Foti, Carmelo ;
Ficarra, Elisa ;
Ferrarini, Alberto ;
Delledonne, Massimo ;
Iacobucci, Ilaria ;
Soverini, Simona ;
Martinelli, Giovanni ;
Macii, Enrico .
BIOINFORMATICS, 2012, 28 (16) :2114-2121
[2]   Transcription-mediated gene fusion in the human genome [J].
Akiva, P ;
Toporik, A ;
Edelheit, S ;
Peretz, Y ;
Diber, A ;
Shemesh, R ;
Novik, A ;
Sorek, R .
GENOME RESEARCH, 2006, 16 (01) :30-36
[3]   Post-transcriptional exon shuffling events in humans can be evolutionarily conserved and abundant [J].
Al-Balool, Haya H. ;
Weber, David ;
Liu, Yilei ;
Wade, Mark ;
Guleria, Kamlesh ;
Pitsien Lang Ping Nam ;
Clayton, Jake ;
Rowe, William ;
Coxhead, Jonathan ;
Irving, Julie ;
Elliott, David J. ;
Hall, Andrew G. ;
Santibanez-Koref, Mauro ;
Jackson, Michael S. .
GENOME RESEARCH, 2011, 21 (11) :1788-1799
[4]   Correlation of circular RNA abundance with proliferation - exemplified with colorectal and ovarian cancer, idiopathic lung fibrosis, and normal human tissues [J].
Bachmayr-Heyda, Anna ;
Reiner, Agnes T. ;
Auer, Katharina ;
Sukhbaatar, Nyamdelger ;
Aust, Stefanie ;
Bachleitner-Hofmann, Thomas ;
Mesteri, Ildiko ;
Grunt, Thomas W. ;
Zeillinger, Robert ;
Pils, Dietmar .
SCIENTIFIC REPORTS, 2015, 5 :8057
[5]   The Landscape of MicroRNA, Piwi-Interacting RNA, and Circular RNA in Human Saliva [J].
Bahn, Jae Hoon ;
Zhang, Qing ;
Li, Feng ;
Chan, Tak-Ming ;
Lin, Xianzhi ;
Kim, Yong ;
Wong, David T. W. ;
Xiao, Xinshu .
CLINICAL CHEMISTRY, 2015, 61 (01) :221-230
[6]   Cloning of BCAS3 (17q23) and BCAS4 (20q13) genes that undergo amplification, overexpression, and fusion in breast cancer [J].
Bärlund, M ;
Monni, O ;
Weaver, JD ;
Kauraniemi, P ;
Sauter, G ;
Heiskanen, M ;
Kallioniemi, OP ;
Kallioniemi, A .
GENES CHROMOSOMES & CANCER, 2002, 35 (04) :311-317
[7]   Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion [J].
Bass, Adam J. ;
Lawrence, Michael S. ;
Brace, Lear E. ;
Ramos, Alex H. ;
Drier, Yotam ;
Cibulskis, Kristian ;
Sougnez, Carrie ;
Voet, Douglas ;
Saksena, Gordon ;
Sivachenko, Andrey ;
Jing, Rui ;
Parkin, Melissa ;
Pugh, Trevor ;
Verhaak, Roel G. ;
Stransky, Nicolas ;
Boutin, Adam T. ;
Barretina, Jordi ;
Solit, David B. ;
Vakiani, Evi ;
Shao, Wenlin ;
Mishina, Yuji ;
Warmuth, Markus ;
Jimenez, Jose ;
Chiang, Derek Y. ;
Signoretti, Sabina ;
Kaelin, William G., Jr. ;
Spardy, Nicole ;
Hahn, William C. ;
Hoshida, Yujin ;
Ogino, Shuji ;
DePinho, Ronald A. ;
Chin, Lynda ;
Garraway, Levi A. ;
Fuchs, Charles S. ;
Baselga, Jose ;
Tabernero, Josep ;
Gabriel, Stacey ;
Lander, Eric S. ;
Getz, Gad ;
Meyerson, Matthew .
NATURE GENETICS, 2011, 43 (10) :964-U67
[8]   State of art fusion-finder algorithms are suitable to detect transcription-induced chimeras in normal tissues? [J].
Carrara, Matteo ;
Beccuti, Marco ;
Cavallo, Federica ;
Donatelli, Susanna ;
Lazzarato, Fulvio ;
Cordero, Francesca ;
Calogero, Raffaele A. .
BMC BIOINFORMATICS, 2013, 14
[9]   State-of-the-Art Fusion-Finder Algorithms Sensitivity and Specificity [J].
Carrara, Matteo ;
Beccuti, Marco ;
Lazzarato, Fulvio ;
Cavallo, Federica ;
Cordero, Francesca ;
Donatelli, Susanna ;
Calogero, Raffaele A. .
BIOMED RESEARCH INTERNATIONAL, 2013, 2013
[10]   Biogenesis, identification, and function of exonic circular RNAs [J].
Chen, Iju ;
Chen, Chia-Ying ;
Chuang, Trees-Juen .
WILEY INTERDISCIPLINARY REVIEWS-RNA, 2015, 6 (05) :563-579