Predicting miRNA-disease associations using an ensemble learning framework with resampling method

被引:37
作者
Dai, Qiguo [1 ]
Wang, Zhaowei [2 ]
Liu, Ziqiang [1 ]
Duan, Xiaodong [1 ,2 ]
Song, Jinmiao [3 ]
Guo, Maozu [4 ]
机构
[1] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian, Peoples R China
[2] Dalian Minzu Univ, Sch Comp Sci & Engn, Comp Sci & Technol, Dalian, Peoples R China
[3] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi, Peoples R China
[4] Beijing Univ Civil Engn & Architecture, Coll Elect & Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
miRNA-disease association; ensemble learning; resampling; feature selection; CANCER CELLS; MICRORNAS; PROLIFERATION;
D O I
10.1093/bib/bbab543
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Accumulating evidences have indicated that microRNA (miRNA) plays a crucial role in the pathogenesis and progression of various complex diseases. Inferring disease-associated miRNAs is significant to explore the etiology, diagnosis and treatment of human diseases. As the biological experiments are time-consuming and labor-intensive, developing effective computational methods has become indispensable to identify associations between miRNAs and diseases. Results: We present an Ensemble learning framework with Resampling method for MiRNA-Disease Association (ERMDA) prediction to discover potential disease-related miRNAs. Firstly, the resampling strategy is proposed for building multiple different balanced training subsets to address the challenge of sample imbalance within the database. Then, ERMDA extracts miRNA and disease feature representations by integrating miRNA-miRNA similarities, disease-disease similarities and experimentally verified miRNA-disease association information. Next, the feature selection approach is applied to reduce the redundant information and increase the diversity among these subsets. Lastly, ERMDA constructs an individual learner on each subset to yield primitive outcomes, and the soft voting method is introduced for making the final decision based on the prediction results of individual learners. A series of experimental results demonstrates that ERMDA outperforms other state-of-the-art methods on both balanced and unbalanced testing sets. Besides, case studies conducted on the three human diseases further confirm the ERMDA's prediction capability for identifying potential disease-related miRNAs. In conclusion, these experimental results demonstrate that our method can serve as an effective and reliable tool for researchers to explore the regulatory role of miRNAs in complex diseases.
引用
收藏
页数:12
相关论文
共 56 条
[1]  
Bandyopadhyay Sanghamitra, 2010, Silence, V1, P6, DOI 10.1186/1758-907X-1-6
[2]   Identification of MicroRNAs as Breast Cancer Prognosis Markers through the Cancer Genome Atlas [J].
Chang, Jeremy T-H. ;
Wang, Fan ;
Chapin, William ;
Huang, R. Stephanie .
PLOS ONE, 2016, 11 (12)
[3]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[4]   NCMCMDA: miRNA-disease association prediction through neighborhood constraint matrix completion [J].
Chen, Xing ;
Sun, Lian-Gang ;
Zhao, Yan .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) :485-496
[5]   Ensemble of decision tree reveals potential miRNA-disease associations [J].
Chen, Xing ;
Zhu, Chi-Chi ;
Yin, Jun .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (07)
[6]   MicroRNAs and complex diseases: from experimental results to computational models [J].
Chen, Xing ;
Xie, Di ;
Zhao, Qi ;
You, Zhu-Hong .
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (02) :515-539
[7]   Novel Human miRNA-Disease Association Inference Based on Random Forest [J].
Chen, Xing ;
Wang, Chun-Chun ;
Yin, Jun ;
You, Zhu-Hong .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 13 :568-579
[8]   EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction [J].
Chen, Xing ;
Huang, Li ;
Xie, Di ;
Zhao, Qi .
CELL DEATH & DISEASE, 2018, 9
[9]   WBSMDA: Within and Between Score for MiRNA-Disease Association prediction [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xu ;
You, Zhu-Hong ;
Deng, Lixi ;
Liu, Ying ;
Zhang, Yongdong ;
Dai, Qionghai .
SCIENTIFIC REPORTS, 2016, 6
[10]   Semi-supervised learning for potential human microRNA-disease associations inference [J].
Chen, Xing ;
Yan, Gui-Ying .
SCIENTIFIC REPORTS, 2014, 4