Ontological function annotation of long noncoding RNAs through hierarchical multi-label classification

被引:50
作者
Zhang, Jingpu [1 ,2 ]
Zhang, Zuping [1 ]
Wang, Zixiang [3 ]
Liu, Yuting [3 ]
Deng, Lei [3 ,4 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Ping Ding Shan Univ, Sch Comp Software, Pingdingshan 467000, Peoples R China
[3] Cent S Univ, Sch Software, Changsha 410075, Hunan, Peoples R China
[4] Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
GENE-EXPRESSION DATA; APOPTOSIS; DATABASE; PROTEIN; MICROARRAY; EVOLUTION; NETWORKS; PATTERNS; ELEMENTS; DISEASE;
D O I
10.1093/bioinformatics/btx833
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Long non-coding RNAs (lncRNAs) are an enormous collection of functional noncoding RNAs. Over the past decades, a large number of novel lncRNA genes have been identified. However, most of the lncRNAs remain function uncharacterized at present. Computational approaches provide a new insight to understand the potential functional implications of lncRNAs. Results: Considering that each lncRNA may have multiple functions and a function may be further specialized into sub-functions, here we describe NeuraNetL2GO, a computational ontological function prediction approach for lncRNAs using hierarchical multi-label classification strategy based on multiple neural networks. The neural networks are incrementally trained level by level, each performing the prediction of gene ontology (GO) terms belonging to a given level. In NeuraNetL2GO, we use topological features of the lncRNA similarity network as the input of the neural networks and employ the output results to annotate the lncRNAs. We show that NeuraNetL2GO achieves the best performance and the overall advantage in maximum F-measure and coverage on the manually annotated lncRNA2GO-55 dataset compared to other state-of-the-art methods.
引用
收藏
页码:1750 / 1757
页数:8
相关论文
共 50 条
[1]  
[Anonymous], 2015, P INT JOINT C NEUR N
[2]   NCBI GEO: mining tens of millions of expression profiles - database and tools update [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D760-D765
[3]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[4]   Reduction strategies for hierarchical multi-label classification in protein function prediction [J].
Cerri, Ricardo ;
Barros, Rodrigo C. ;
de Carvalho, Andre C. P. L. F. ;
Jin, Yaochu .
BMC BIOINFORMATICS, 2016, 17
[5]   Hierarchical multi-label classification using local neural networks [J].
Cerri, Ricardo ;
Barros, Rodrigo C. ;
de Carvalho, Andre C. P. L. F. .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (01) :39-56
[6]   Diffusion Component Analysis: Unraveling Functional Topology in Biological Networks [J].
Cho, Hyunghoon ;
Berger, Bonnie ;
Peng, Jian .
RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY (RECOMB 2015), 2015, 9029 :62-64
[7]   Chromatin Isolation by RNA Purification (ChIRP) [J].
Chu, Ci ;
Quinn, Jeffrey ;
Chang, Howard Y. .
JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2012, (61)
[8]   An Integrated Framework for Functional Annotation of Protein Structural Domains [J].
Deng, Lei ;
Chen, Zhigang .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (04) :902-913
[9]   The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression [J].
Derrien, Thomas ;
Johnson, Rory ;
Bussotti, Giovanni ;
Tanzer, Andrea ;
Djebali, Sarah ;
Tilgner, Hagen ;
Guernec, Gregory ;
Martin, David ;
Merkel, Angelika ;
Knowles, David G. ;
Lagarde, Julien ;
Veeravalli, Lavanya ;
Ruan, Xiaoan ;
Ruan, Yijun ;
Lassmann, Timo ;
Carninci, Piero ;
Brown, James B. ;
Lipovich, Leonard ;
Gonzalez, Jose M. ;
Thomas, Mark ;
Davis, Carrie A. ;
Shiekhattar, Ramin ;
Gingeras, Thomas R. ;
Hubbard, Tim J. ;
Notredame, Cedric ;
Harrow, Jennifer ;
Guigo, Roderic .
GENOME RESEARCH, 2012, 22 (09) :1775-1789
[10]   The lincRNA HOTAIRM1, located in the HOXA genomic region, is expressed in acute myeloid leukemia, impacts prognosis in patients in the intermediate-risk cytogenetic category, and is associated with a distinctive microRNA signature [J].
Diaz-Beya, Marina ;
Brunet, Salut ;
Nomdedeu, Josep ;
Pratcorona, Marta ;
Cordeiro, Anna ;
Gallardo, David ;
Escoda, Lourdes ;
Tormo, Mar ;
Heras, Inmaculada ;
Maria Ribera, Josep ;
Duarte, Rafael ;
Queipo de Llano, Maria Paz ;
Bargay, Joan ;
Sampol, Antonia ;
Nomdedeu, Meritxell ;
Risueno, Ruth M. ;
Hoyos, Montserrat ;
Sierra, Jorge ;
Monzo, Mariano ;
Navarro, Alfons ;
Esteve, Jordi .
ONCOTARGET, 2015, 6 (31) :31613-31627