KATZLGO: Large-Scale Prediction of LncRNA Functions by Using the KATZ Measure Based on Multiple Networks

被引:681
作者
Zhang, Zuping [1 ]
Zhang, Jingpu [1 ]
Fan, Chao [2 ]
Tang, Yongjun [3 ]
Deng, Lei [2 ,4 ]
机构
[1] Cent South Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Cent South Univ, Sch Software, Changsha 410075, Hunan, Peoples R China
[3] Cent South Univ, Dept Clin Pharmacol, Dept Pediat, Xiangya Hosp, Changsha 410008, Hunan, Peoples R China
[4] Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Katz measure; lncRNA function; multiple networks; LONG NONCODING RNAS; EXPRESSION DATA; LOCI; DATABASE; ANNOTATION; MICROARRAY; EVOLUTION; REVEALS; CCAT2; 8Q24;
D O I
10.1109/TCBB.2017.2704587
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aggregating evidences have shown that long non-coding RNAs (lncRNAs) generally play key roles in cellular biological processes such as epigenetic regulation, gene expression regulation at transcriptional and post-transcriptional levels, cell differentiation, and others. However, most lncRNAs have not been functionally characterized. There is an urgent need to develop computational approaches for function annotation of increasing available lncRNAs. In this article, we propose a global network-based method, KATZLGO, to predict the functions of human lncRNAs at large scale. A global network is constructed by integrating three heterogeneous networks: lncRNA-lncRNA similarity network, lncRNA-protein association network, and protein-protein interaction network. The KATZ measure is then employed to calculate similarities between lncRNAs and proteins in the global network. We annotate lncRNAs with Gene Ontology (GO) terms of their neighboring protein-coding genes based on the KATZ similarity scores. The performance of KATZLGO is evaluated on a manually annotated lncRNA benchmark and a protein-coding gene benchmark with known function annotations. KATZLGO significantly outperforms state-of-the-art computational method both in maximum F-measure and coverage. Furthermore, we apply KATZLGO to predict functions of human lncRNAs and successfully map 12,318 human lncRNA genes to GO terms.
引用
收藏
页码:407 / 416
页数:10
相关论文
共 46 条
[31]   Long non-coding RNAs: insights into functions [J].
Mercer, Tim R. ;
Dinger, Marcel E. ;
Mattick, John S. .
NATURE REVIEWS GENETICS, 2009, 10 (03) :155-159
[32]   COXPRESdb in 2015: coexpression database for animal species by DNA-microarray and RNAseq-based expression data with multiple quality assessment systems [J].
Okamura, Yasunobu ;
Aoki, Yuichi ;
Obayashi, Takeshi ;
Tadaka, Shu ;
Ito, Satoshi ;
Narise, Takafumi ;
Kinoshita, Kengo .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D82-D86
[33]   Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-Wide Meta-analysis [J].
Peters, Ulrike ;
Jiao, Shuo ;
Schumacher, Fredrick R. ;
Hutter, Carolyn M. ;
Aragaki, Aaron K. ;
Baron, John A. ;
Berndt, Sonja I. ;
Bezieau, Stephane ;
Brenner, Hermann ;
Butterbach, Katja ;
Caan, Bette J. ;
Campbell, Peter T. ;
Carlson, Christopher S. ;
Casey, Graham ;
Chan, Andrew T. ;
Chang-Claude, Jenny ;
Chanock, Stephen J. ;
Chen, Lin S. ;
Coetzee, Gerhard A. ;
Coetzee, Simon G. ;
Conti, David V. ;
Curtis, Keith R. ;
Duggan, David ;
Edwards, Todd ;
Fuchs, Charles S. ;
Gallinger, Steven ;
Giovannucci, Edward L. ;
Gogarten, Stephanie M. ;
Gruber, Stephen B. ;
Haile, Robert W. ;
Harrison, Tabitha A. ;
Hayes, Richard B. ;
Henderson, Brian E. ;
Hoffmeister, Michael ;
Hopper, John L. ;
Hudson, Thomas J. ;
Hunter, David J. ;
Jackson, Rebecca D. ;
Jee, Sun Ha ;
Jenkins, Mark A. ;
Jia, Wei-Hua ;
Kolonel, Laurence N. ;
Kooperberg, Charles ;
Kuery, Sebastien ;
Lacroix, Andrea Z. ;
Laurie, Cathy C. ;
Laurie, Cecelia A. ;
Le Marchand, Loic ;
Lemire, Mathieu ;
Levine, David .
GASTROENTEROLOGY, 2013, 144 (04) :799-+
[34]   Template-based prediction of protein function [J].
Petrey, Donald ;
Chen, T. Scott ;
Deng, Lei ;
Garzon, Jose Ignacio ;
Hwang, Howook ;
Lasso, Gorka ;
Lee, Hunjoong ;
Silkov, Antonina ;
Honig, Barry .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2015, 32 :33-38
[35]   Evolution and Functions of Long Noncoding RNAs [J].
Ponting, Chris P. ;
Oliver, Peter L. ;
Reik, Wolf .
CELL, 2009, 136 (04) :629-641
[36]   CCAT2 is a lung adenocarcinoma-specific long non-coding RNA and promotes invasion of non-small cell lung cancer [J].
Qiu, Mantang ;
Xu, Youtao ;
Yang, Xin ;
Wang, Jie ;
Hu, Jingwen ;
Xu, Lin ;
Yin, Rong .
TUMOR BIOLOGY, 2014, 35 (06) :5375-5380
[37]  
Radivojac P, 2013, NAT METHODS, V10, P221, DOI [10.1038/nmeth.2340, 10.1038/NMETH.2340]
[38]   Functional demarcation of active and silent chromatin domains in human HOX loci by Noncoding RNAs [J].
Rinn, John L. ;
Kertesz, Michael ;
Wang, Jordon K. ;
Squazzo, Sharon L. ;
Xu, Xiao ;
Brugmann, Samantha A. ;
Goodnough, L. Henry ;
Helms, Jill A. ;
Farnham, Peggy J. ;
Segal, Eran ;
Chang, Howard Y. .
CELL, 2007, 129 (07) :1311-1323
[39]   Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses [J].
Singh-Blom, U. Martin ;
Natarajan, Nagarajan ;
Tewari, Ambuj ;
Woods, John O. ;
Dhillon, Inderjit S. ;
Marcotte, Edward M. .
PLOS ONE, 2013, 8 (05)
[40]   STRING v10: protein-protein interaction networks, integrated over the tree of life [J].
Szklarczyk, Damian ;
Franceschini, Andrea ;
Wyder, Stefan ;
Forslund, Kristoffer ;
Heller, Davide ;
Huerta-Cepas, Jaime ;
Simonovic, Milan ;
Roth, Alexander ;
Santos, Alberto ;
Tsafou, Kalliopi P. ;
Kuhn, Michael ;
Bork, Peer ;
Jensen, Lars J. ;
von Mering, Christian .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D447-D452