ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations

被引:8
作者
Chen, Xue-Jun [1 ]
Hua, Xin-Yun [1 ]
Jiang, Zhen-Ran [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
基金
国家重点研发计划;
关键词
miRNA-disease association; k-means; Noise smoothing; Light gradient boosting machine; SIMILARITY; EXPRESSION; MICRORNAS;
D O I
10.1186/s12859-021-04266-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background A growing proportion of research has proved that microRNAs (miRNAs) can regulate the function of target genes and have close relations with various diseases. Developing computational methods to exploit more potential miRNA-disease associations can provide clues for further functional research. Results Inspired by the work of predecessors, we discover that the noise hiding in the data can affect the prediction performance and then propose an anti-noise algorithm (ANMDA) to predict potential miRNA-disease associations. Firstly, we calculate the similarity in miRNAs and diseases to construct features and obtain positive samples according to the Human MicroRNA Disease Database version 2.0 (HMDD v2.0). Then, we apply k-means on the undetected miRNA-disease associations and sample the negative examples equally from the k-cluster. Further, we construct several data subsets through sampling with replacement to feed on the light gradient boosting machine (LightGBM) method. Finally, the voting method is applied to predict potential miRNA-disease relationships. As a result, ANMDA can achieve an area under the receiver operating characteristic curve (AUROC) of 0.9373 +/- 0.0005 in five-fold cross-validation, which is superior to several published methods. In addition, we analyze the predicted miRNA-disease associations with high probability and compare them with the data in HMDD v3.0 in the case study. The results show ANMDA is a novel and practical algorithm that can be used to infer potential miRNA-disease associations. Conclusion The results indicate the noise hiding in the data has an obvious impact on predicting potential miRNA-disease associations. We believe ANMDA can achieve better results from this task with more methods used in dealing with the data noise.
引用
收藏
页数:15
相关论文
共 37 条
[1]  
Chen X., 2021, BRIEF BIOINFORM, V22, pBBAA186, DOI [10.1093/bib/bbaa186, DOI 10.1093/BIB/BBAA186]
[2]   MicroRNAs and complex diseases: from experimental results to computational models [J].
Chen, Xing ;
Xie, Di ;
Zhao, Qi ;
You, Zhu-Hong .
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (02) :515-539
[3]   RKNNMDA: Ranking-based KNN for MiRNA-Disease Association prediction [J].
Chen, Xing ;
Wu, Qiao-Feng ;
Yan, Gui-Ying .
RNA BIOLOGY, 2017, 14 (07) :952-962
[4]   WBSMDA: Within and Between Score for MiRNA-Disease Association prediction [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xu ;
You, Zhu-Hong ;
Deng, Lixi ;
Liu, Ying ;
Zhang, Yongdong ;
Dai, Qionghai .
SCIENTIFIC REPORTS, 2016, 6
[5]   Novel human lncRNA-disease association inference based on lncRNA expression profiles [J].
Chen, Xing ;
Yan, Gui-Ying .
BIOINFORMATICS, 2013, 29 (20) :2617-2624
[6]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[7]   PMAMCA: prediction of microRNA-disease association utilizing a matrix completion approach [J].
Ha, Jihwan ;
Park, Chihyun ;
Park, Sanghyun .
BMC SYSTEMS BIOLOGY, 2019, 13
[8]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[9]   Stem cell division is regulated by the microRNA pathway [J].
Hatfield, SD ;
Shcherbata, HR ;
Fischer, KA ;
Nakahara, K ;
Carthew, RW ;
Ruohola-Baker, H .
NATURE, 2005, 435 (7044) :974-978
[10]   A polycistronic microRNA cluster, miR-17-92, is overexpressed in human lung cancers and enhances cell proliferation [J].
Hayashita, Y ;
Osada, H ;
Tatematsu, Y ;
Yamada, H ;
Yanagisawa, K ;
Tomida, S ;
Yatabe, Y ;
Kawahara, K ;
Sekido, Y ;
Takahashi, T .
CANCER RESEARCH, 2005, 65 (21) :9628-9632