EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction

被引:233
作者
Chen, Xing [1 ]
Huang, Li [2 ]
Xie, Di [3 ]
Zhao, Qi [4 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
[2] Natl Univ Singapore, Business Analyt Ctr, Singapore 119613, Singapore
[3] Liaoning Univ, Sch Math, Shenyang 110036, Liaoning, Peoples R China
[4] Res Ctr Comp Simulating & Informat Proc Biomacrom, Shenyang 110036, Liaoning, Peoples R China
来源
CELL DEATH & DISEASE | 2018年 / 9卷
基金
中国国家自然科学基金;
关键词
HUMAN MICRORNA; TUMOR-SUPPRESSOR; EXPRESSION; CANCER; IDENTIFICATION; RECEPTOR; GENES; LYMPHOMA; DATABASE; TARGETS;
D O I
10.1038/s41419-017-0003-x
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Associations between microRNAs (miRNAs) and human diseases have been identified by increasing studies and discovering new ones is an ongoing process in medical laboratories. To improve experiment productivity, researchers computationally infer potential associations from biological data, selecting the most promising candidates for experimental verification. Predicting potential miRNA-disease association has become a research area of growing importance. This paper presents a model of Extreme Gradient Boosting Machine for MiRNA-Disease Association (EGBMMDA) prediction by integrating the miRNA functional similarity, the disease semantic similarity, and known miRNA-disease associations. The statistical measures, graph theoretical measures, and matrix factorization results for each miRNA-disease pair were calculated and used to form an informative feature vector. The vector for known associated pairs obtained from the HMDD v2.0 database was used to train a regression tree under the gradient boosting framework. EGBMMDA was the first decision tree learning-based model used for predicting miRNA-disease associations. Respectively, AUCs of 0.9123 and 0.8221 in global and local leave-one-out cross-validation proved the model's reliable performance. Moreover, the 0.9048 +/- 0.0012 AUC in fivefold cross-validation confirmed its stability. We carried out three different types of case studies of predicting potential miRNAs related to Colon Neoplasms, Lymphoma, Prostate Neoplasms, Breast Neoplasms, and Esophageal Neoplasms. The results indicated that, respectively, 98%, 90%, 98%, 100%, and 98% of the top 50 predictions for the five diseases were confirmed by experiments. Therefore, EGBMMDA appears to be a useful computational resource for miRNA-disease association prediction.
引用
收藏
页数:16
相关论文
共 59 条
[1]   Gene prioritization through genomic data fusion [J].
Aerts, S ;
Lambrechts, D ;
Maity, S ;
Van Loo, P ;
Coessens, B ;
De Smet, F ;
Tranchevent, LC ;
De Moor, B ;
Marynen, P ;
Hassan, B ;
Carmeliet, P ;
Moreau, Y .
NATURE BIOTECHNOLOGY, 2006, 24 (05) :537-544
[2]   The functions of animal microRNAs [J].
Ambros, V .
NATURE, 2004, 431 (7006) :350-355
[3]   MicroRNAs: Target Recognition and Regulatory Functions [J].
Bartel, David P. .
CELL, 2009, 136 (02) :215-233
[4]   MicroRNA-200 is commonly repressed in conjunctival MALT lymphoma, and targets cyclin E2 [J].
Cai, Jiping ;
Liu, Xiaoyu ;
Cheng, Jinwei ;
Li, You ;
Huang, Xiao ;
Li, Yuzhen ;
Ma, Xiaoye ;
Yu, Hongyu ;
Liu, Huimin ;
Wei, Ruili .
GRAEFES ARCHIVE FOR CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2012, 250 (04) :523-531
[5]   Frequent deletions and down-regulation of micro-RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia [J].
Calin, GA ;
Dumitru, CD ;
Shimizu, M ;
Bichi, R ;
Zupo, S ;
Noch, E ;
Aldler, H ;
Rattan, S ;
Keating, M ;
Rai, K ;
Rassenti, L ;
Kipps, T ;
Negrini, M ;
Bullrich, F ;
Croce, CM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (24) :15524-15529
[6]   MicroRNA signatures in human cancers [J].
Calin, George A. ;
Croce, Carlo M. .
NATURE REVIEWS CANCER, 2006, 6 (11) :857-866
[7]   MicroRNA-101 (miR-101) post-transcriptionally regulates the expression of EP4 receptor in colon cancers [J].
Chandramouli, Anupama ;
Onyeagucha, Benjamin Chidi ;
Mercado-Pimentel, Melania E. ;
Stankova, Lenka ;
Abu Shahin, Nisreen ;
LaFleur, Bonnie J. ;
Heimark, Ronald L. ;
Bhattacharyya, Achyut K. ;
Nelson, Mark A. .
CANCER BIOLOGY & THERAPY, 2012, 13 (03) :175-183
[8]  
Chen T., 2015, NIPS 2014 WORKSH HIG, P69
[9]  
Chen T, 2016, ABS160302754 CORR
[10]   HGIMDA: Heterogeneous graph inference for miRNA-disease association prediction [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xu ;
You, Zhu-Hong ;
Huang, Yu-An ;
Yan, Gui-Ying .
ONCOTARGET, 2016, 7 (40) :65257-65269