Identification and function annotation of long intervening noncoding RNAs

被引:27
作者
Luo, Haitao [1 ,3 ]
Bu, Dechao [1 ]
Sun, Liang [1 ]
Fang, Shuangsang [1 ]
Liu, Zhiyong [4 ]
Zhao, Yi [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Precis Med Ctr, Res Inst Informat Ind LuoYang, LuoYang Branch,Inst Comp Technol, Luoyang 471000, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Comp Technol, Adv Comp Res Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
transcriptom sequencing; noncoding RNAs; gene identification; function annotation; network analysis; computational method; SECONDARY STRUCTURE; EXPRESSION ANALYSIS; SEQ; TRANSCRIPTOME; PREDICTION; REVEALS; RECONSTRUCTION; TOPHAT; GENE; DNA;
D O I
10.1093/bib/bbw046
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
RNA-seq technology offers the promise of rapid comprehensive discovery of long intervening noncoding RNAs (lincRNAs). Basic tools such as Tophat and Cufflinks have been widely used for RNA-seq assembly. However, advanced bioinformatics methodologies that allow in-depth analysis of lincRNAs are lacking. Here, we describe a computational protocol that is especially designed for the identification of novel lincRNAs and the prediction of the function. The protocol mainly includes two open-access tools, CNCI and ncFANs. CNCI allows users to distinguish noncoding from protein-coding transcripts and to retrieve novel lincRNAs. ncFANs integrates expression profiles of protein-coding and lincRNA genes to construct co-expression networks. Such networks are subsequently used to perform function predictions of unknown lincRNAs. This protocol will allow users to apply these procedures without the need of additional training. All the tools in current protocol are available http://www.bioinfo.org/np/.
引用
收藏
页码:789 / 797
页数:9
相关论文
共 41 条
[1]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[2]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[3]   Count-based differential expression analysis of RNA sequencing data using R and Bioconductor [J].
Anders, Simon ;
McCarthy, Davis J. ;
Chen, Yunshun ;
Okoniewski, Michal ;
Smyth, Gordon K. ;
Huber, Wolfgang ;
Robinson, Mark D. .
NATURE PROTOCOLS, 2013, 8 (09) :1765-1786
[4]   RNAstructure: web servers for RNA secondary structure prediction and analysis [J].
Bellaousov, Stanislav ;
Reuter, Jessica S. ;
Seetin, Matthew G. ;
Mathews, David H. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W471-W474
[5]   Predicting protein associations with long noncoding RNAs [J].
Bellucci, Matteo ;
Agostini, Federico ;
Masin, Marianela ;
Tartaglia, Gian Gaetano .
NATURE METHODS, 2011, 8 (06) :444-445
[6]   Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses [J].
Cabili, Moran N. ;
Trapnell, Cole ;
Goff, Loyal ;
Koziol, Magdalena ;
Tazon-Vega, Barbara ;
Regev, Aviv ;
Rinn, John L. .
GENES & DEVELOPMENT, 2011, 25 (18) :1915-1927
[7]   STAR: ultrafast universal RNA-seq aligner [J].
Dobin, Alexander ;
Davis, Carrie A. ;
Schlesinger, Felix ;
Drenkow, Jorg ;
Zaleski, Chris ;
Jha, Sonali ;
Batut, Philippe ;
Chaisson, Mark ;
Gingeras, Thomas R. .
BIOINFORMATICS, 2013, 29 (01) :15-21
[8]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[9]   The Xist lncRNA Exploits Three-Dimensional Genome Architecture to Spread Across the X Chromosome [J].
Engreitz, Jesse M. ;
Pandya-Jones, Amy ;
McDonel, Patrick ;
Shishkin, Alexander ;
Sirokman, Klara ;
Surka, Christine ;
Kadri, Sabah ;
Xing, Jeffrey ;
Goren, Alon ;
Lander, Eric S. ;
Plath, Kathrin ;
Guttman, Mitchell .
SCIENCE, 2013, 341 (6147) :767-+
[10]   Ballgown bridges the gap between transcriptome assembly and expression analysis [J].
Frazee, Alyssa C. ;
Pertea, Geo ;
Jaffe, Andrew E. ;
Langmead, Ben ;
Salzberg, Steven L. ;
Leek, Jeffrey T. .
NATURE BIOTECHNOLOGY, 2015, 33 (03) :243-246