iSeeRNA: identification of long intergenic non-coding RNA transcripts from transcriptome sequencing data

被引:123
作者
Sun, Kun [1 ,2 ]
Chen, Xiaona [1 ,3 ]
Jiang, Peiyong [1 ,2 ]
Song, Xiaofeng [4 ]
Wang, Huating [1 ,3 ]
Sun, Hao [1 ,2 ]
机构
[1] Chinese Univ Hong Kong, Li Ka Shing Inst Hlth Sci, Shatin, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Chem Pathol, Shatin, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Obstet & Gynaecol, Shatin, Hong Kong, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Dept Biomed Engn, Nanjing 210016, Peoples R China
基金
中国国家自然科学基金;
关键词
VERTEBRATE;
D O I
10.1186/1471-2164-14-S2-S7
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Long intergenic non-coding RNAs (lincRNAs) are emerging as a novel class of non-coding RNAs and potent gene regulators. High-throughput RNA-sequencing combined with de novo assembly promises quantity discovery of novel transcripts. However, the identification of lincRNAs from thousands of assembled transcripts is still challenging due to the difficulties of separating them from protein coding transcripts (PCTs). Results: We have implemented iSeeRNA, a support vector machine (SVM)-based classifier for the identification of lincRNAs. iSeeRNA shows better performance compared to other software. A public available webserver for iSeeRNA is also provided for small size dataset. Conclusions: iSeeRNA demonstrates high prediction accuracy and runs several magnitudes faster than other similar programs. It can be integrated into the transcriptome data analysis pipelines or run as a web server, thus offering a valuable tool for lincRNA study.
引用
收藏
页数:10
相关论文
共 27 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus Paracoccidioides brasiliensis Software [J].
Arrial, Roberto T. ;
Togawa, Roberto C. ;
Brigido, Marcelo de M. .
BMC BIOINFORMATICS, 2009, 10
[3]  
Byvatov Evgeny, 2003, Appl Bioinformatics, V2, P67
[4]  
Cabili MN, GENES DEV, V25, P1915
[5]   The transcriptional landscape of the mammalian genome [J].
Carninci, P ;
Kasukawa, T ;
Katayama, S ;
Gough, J ;
Frith, MC ;
Maeda, N ;
Oyama, R ;
Ravasi, T ;
Lenhard, B ;
Wells, C ;
Kodzius, R ;
Shimokawa, K ;
Bajic, VB ;
Brenner, SE ;
Batalov, S ;
Forrest, ARR ;
Zavolan, M ;
Davis, MJ ;
Wilming, LG ;
Aidinis, V ;
Allen, JE ;
Ambesi-Impiombato, X ;
Apweiler, R ;
Aturaliya, RN ;
Bailey, TL ;
Bansal, M ;
Baxter, L ;
Beisel, KW ;
Bersano, T ;
Bono, H ;
Chalk, AM ;
Chiu, KP ;
Choudhary, V ;
Christoffels, A ;
Clutterbuck, DR ;
Crowe, ML ;
Dalla, E ;
Dalrymple, BP ;
de Bono, B ;
Della Gatta, G ;
di Bernardo, D ;
Down, T ;
Engstrom, P ;
Fagiolini, M ;
Faulkner, G ;
Fletcher, CF ;
Fukushima, T ;
Furuno, M ;
Futaki, S ;
Gariboldi, M .
SCIENCE, 2005, 309 (5740) :1559-1563
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Distinguishing protein-coding and noncoding genes in the human genome [J].
Clamp, Michele ;
Fry, Ben ;
Kamal, Mike ;
Xie, Xiaohui ;
Cuff, James ;
Lin, Michael F. ;
Kellis, Manolis ;
Lindblad-Toh, Kerstin ;
Lander, Eric S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (49) :19428-19433
[8]   Differentiating Protein-Coding and Noncoding RNA: Challenges and Ambiguities [J].
Dinger, Marcel E. ;
Pang, Ken C. ;
Mercer, Tim R. ;
Mattick, John S. .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (11)
[9]   Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs [J].
Guttman, Mitchell ;
Garber, Manuel ;
Levin, Joshua Z. ;
Donaghey, Julie ;
Robinson, James ;
Adiconis, Xian ;
Fan, Lin ;
Koziol, Magdalena J. ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Rinn, John L. ;
Lander, Eric S. ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2010, 28 (05) :503-U166
[10]   Genome-wide computational identification and manual annotation of human long noncoding RNA genes [J].
Jia, Hui ;
Osak, Maureen ;
Bogu, Gireesh K. ;
Stanton, Lawrence W. ;
Johnson, Rory ;
Lipovich, Leonard .
RNA, 2010, 16 (08) :1478-1487