Prediction of lncRNA-disease associations based on inductive matrix completion

被引:245
作者
Lu, Chengqian [1 ]
Yang, Mengyun [1 ]
Luo, Feng [2 ]
Wu, Fang-Xiang [3 ]
Li, Min [1 ]
Pan, Yi [4 ]
Li, Yaohang [5 ]
Wang, Jianxin [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Clemson Univ, Sch Comp, Clemson, SC 29634 USA
[3] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
[4] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
[5] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
基金
中国国家自然科学基金;
关键词
RENAL-CELL CARCINOMA; NONCODING RNA; EXPRESSION; SIMILARITY; GENES; MODEL;
D O I
10.1093/bioinformatics/bty327
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Accumulating evidences indicate that long non-coding RNAs (lncRNAs) play pivotal roles in various biological processes. Mutations and dysregulations of lncRNAs are implicated in miscellaneous human diseases. Predicting lncRNA-disease associations is beneficial to disease diagnosis as well as treatment. Although many computational methods have been developed, precisely identifying lncRNA-disease associations, especially for novel lncRNAs, remains challenging. Results: In this study, we propose a method (named SIMCLDA) for predicting potential lncRNA-disease associations based on inductive matrix completion. We compute Gaussian interaction profile kernel of lncRNAs from known lncRNA-disease interactions and functional similarity of diseases based on disease-gene and gene-gene onotology associations. Then, we extract primary feature vectors from Gaussian interaction profile kernel of lncRNAs and functional similarity of diseases by principal component analysis, respectively. For a new lncRNA, we calculate the interaction profile according to the interaction profiles of its neighbors. At last, we complete the association matrix based on the inductive matrix completion framework using the primary feature vectors from the constructed feature matrices. Computational results show that SIMCLDA can effectively predict lncRNA-disease associations with higher accuracy compared with previous methods. Furthermore, case studies show that SIMCLDA can effectively predict candidate lncRNAs for renal cancer, gastric cancer and prostate cancer. Availability and implementation: https://github.com//bioinfomaticsCSU/SIMCLDA
引用
收藏
页码:3357 / 3364
页数:8
相关论文
共 49 条
[1]   The Ensembl gene annotation system [J].
Aken, Bronwen L. ;
Ayling, Sarah ;
Barrell, Daniel ;
Clarke, Laura ;
Curwen, Valery ;
Fairley, Susan ;
Banet, Julio Fernandez ;
Billis, Konstantinos ;
Giron, Carlos Garcia ;
Hourlier, Thibaut ;
Howe, Kevin ;
Kahari, Andreas ;
Kokocinski, Felix ;
Martin, Fergal J. ;
Murphy, Daniel N. ;
Nag, Rishi ;
Ruffier, Magali ;
Schuster, Michael ;
Tang, Y. Amy ;
Vogel, Jan-Hinnerk ;
White, Simon ;
Zadissa, Amonida ;
Flicek, Paul ;
Searle, Stephen M. J. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[2]  
[Anonymous], 2002, THESIS STANFORD U
[3]  
[Anonymous], 2014, INT C MACH LEARN
[4]  
[Anonymous], 2013, Advances in Neural Information Processing Systems
[5]  
[Anonymous], 1908, Bull. Soc. Vaud. Sci. Nat.
[6]   A SINGULAR VALUE THRESHOLDING ALGORITHM FOR MATRIX COMPLETION [J].
Cai, Jian-Feng ;
Candes, Emmanuel J. ;
Shen, Zuowei .
SIAM JOURNAL ON OPTIMIZATION, 2010, 20 (04) :1956-1982
[7]   LncRNADisease: a database for long-non-coding RNA-associated diseases [J].
Chen, Geng ;
Wang, Ziyun ;
Wang, Dongqing ;
Qiu, Chengxiang ;
Liu, Mingxi ;
Chen, Xing ;
Zhang, Qipeng ;
Yan, Guiying ;
Cui, Qinghua .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D983-D986
[8]  
Chen W, 1997, J PATHOL, V183, P345, DOI 10.1002/(SICI)1096-9896(199711)183:3<345::AID-PATH930>3.0.CO
[9]  
2-8
[10]   IRWRLDA: improved random walk with restart for lncRNA-disease association prediction [J].
Chen, Xing ;
You, Zhu-Hong ;
Yan, Gui-Ying ;
Gong, Dun-Wei .
ONCOTARGET, 2016, 7 (36) :57919-57931