Predicting miRNA-disease associations using a hybrid feature representation in the heterogeneous network

被引:10
作者
Liu, Minghui [1 ]
Yang, Jingyi [1 ]
Wang, Jiacheng [1 ]
Deng, Lei [1 ,2 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410075, Peoples R China
[2] Xinjiang Univ, Sch Software, Urumqi 830008, Peoples R China
基金
中国国家自然科学基金;
关键词
miRNA-disease association; HeteSim measure; Diffusion feature; eXtreme gradient boosting; MICRORNA; HETESIM;
D O I
10.1186/s12920-020-00783-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
BackgroundStudies have found that miRNAs play an important role in many biological activities involved in human diseases. Revealing the associations between miRNA and disease by biological experiments is time-consuming and expensive. The computational approaches provide a new alternative. However, because of the limited knowledge of the associations between miRNAs and diseases, it is difficult to support the prediction model effectively.MethodsIn this work, we propose a model to predict miRNA-disease associations, MDAPCOM, in which protein information associated with miRNAs and diseases is introduced to build a global miRNA-protein-disease network. Subsequently, diffusion features and HeteSim features, extracted from the global network, are combined to train the prediction model by eXtreme Gradient Boosting (XGBoost).ResultsThe MDAPCOM model achieves AUC of 0.991 based on 10-fold cross-validation, which is significantly better than that of other two state-of-the-art methods RWRMDA and PRINCE. Furthermore, the model performs well on three unbalanced data sets.ConclusionsThe results suggest that the information behind proteins associated with miRNAs and diseases is crucial to the prediction of the associations between miRNAs and diseases, and the hybrid feature representation in the heterogeneous network is very effective for improving predictive performance.
引用
收藏
页数:11
相关论文
共 38 条
[1]   The functions of animal microRNAs [J].
Ambros, V .
NATURE, 2004, 431 (7006) :350-355
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[4]   Frequent deletions and down-regulation of micro-RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia [J].
Calin, GA ;
Dumitru, CD ;
Shimizu, M ;
Bichi, R ;
Zupo, S ;
Noch, E ;
Aldler, H ;
Rattan, S ;
Keating, M ;
Rai, K ;
Rassenti, L ;
Kipps, T ;
Negrini, M ;
Bullrich, F ;
Croce, CM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (24) :15524-15529
[5]  
Chen W, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), P22, DOI 10.1109/RCAR.2016.7783995
[6]   EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction [J].
Chen, Xing ;
Huang, Li ;
Xie, Di ;
Zhao, Qi .
CELL DEATH & DISEASE, 2018, 9
[7]   Semi-supervised learning for potential human microRNA-disease associations inference [J].
Chen, Xing ;
Yan, Gui-Ying .
SCIENTIFIC REPORTS, 2014, 4
[8]   RWRMDA: predicting novel human microRNA-disease associations [J].
Chen, Xing ;
Liu, Ming-Xi ;
Yan, Gui-Ying .
MOLECULAR BIOSYSTEMS, 2012, 8 (10) :2792-2798
[9]   miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions [J].
Chou, Chih-Hung ;
Shrestha, Sirjana ;
Yang, Chi-Dung ;
Chang, Nai-Wen ;
Lin, Yu-Ling ;
Liao, Kuang-Wen ;
Huang, Wei-Chi ;
Sun, Ting-Hsuan ;
Tu, Siang-Jyun ;
Lee, Wei-Hsiang ;
Chiew, Men-Yee ;
Tai, Chun-San ;
Wei, Ting-Yen ;
Tsai, Tzi-Ren ;
Huang, Hsin-Tzu ;
Wang, Chung-Yu ;
Wu, Hsin-Yi ;
Ho, Shu-Yi ;
Chen, Pin-Rong ;
Chuang, Cheng-Hsun ;
Hsieh, Pei-Jung ;
Wu, Yi-Shin ;
Chen, Wen-Liang ;
Li, Meng-Ju ;
Wu, Yu-Chun ;
Huang, Xin-Yi ;
Ng, Fung Ling ;
Buddhakosai, Waradee ;
Huang, Pei-Chun ;
Lan, Kuan-Chun ;
Huang, Chia-Yen ;
Weng, Shun-Long ;
Cheng, Yeong-Nan ;
Liang, Chao ;
Hsu, Wen-Lian ;
Huang, Hsien-Da .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D296-D302
[10]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232