LMTRDA: Using logistic model tree to predict MiRNA-disease associations by fusing multi-source information of sequences and similarities

被引:95
|
作者
Wang, Lei [1 ]
You, Zhu-Hong [1 ]
Chen, Xing [2 ]
Li, Yang-Ming [3 ]
Dong, Ya-Nan [4 ]
Li, Li-Ping [1 ]
Zheng, Kai [1 ]
机构
[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China
[2] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou, Jiangsu, Peoples R China
[3] Rochester Inst Technol, Dept Elect Comp & Telecommun Engn Technol, Rochester, NY 14623 USA
[4] Cent South Univ, Xiangya Sch Publ Hlth, Changsha, Hunan, Peoples R China
基金
美国国家科学基金会;
关键词
PROTEIN-PROTEIN INTERACTIONS; MICRORNAS; IDENTIFICATION; NETWORK;
D O I
10.1371/journal.pcbi.1006865
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Emerging evidence has shown microRNAs (miRNAs) play an important role in human disease research. Identifying potential association among them is significant for the development of pathology, diagnose and therapy. However, only a tiny portion of all miRNA-disease pairs in the current datasets are experimentally validated. This prompts the development of high-precision computational methods to predict real interaction pairs. In this paper, we propose a new model of Logistic Model Tree for predicting miRNA-Disease Association (LMTRDA) by fusing multi-source information including miRNA sequences, miRNA functional similarity, disease semantic similarity, and known miRNA-disease associations. In particular, we introduce miRNA sequence information and extract its features using natural language processing technique for the first time in the miRNA-disease prediction model. In the cross-validation experiment, LMTRDA obtained 90.51% prediction accuracy with 92.55% sensitivity at the AUC of 90.54% on the HMDD V3.0 dataset. To further evaluate the performance of LMTRDA, we compared it with different classifier and feature descriptor models. In addition, we also validate the predictive ability of LMTRDA in human diseases including Breast Neoplasms, Breast Neoplasms and Lymphoma. As a result, 28, 27 and 26 out of the top 30 miRNAs associated with these diseases were verified by experiments in different kinds of case studies. These experimental results demonstrate that LMTRDA is a reliable model for predicting the association among miRNAs and diseases. Author summary Identification of miRNA-disease associations is considered as an important step for the development of diagnose and therapy. Computational methods contribute to discovering the potential disease-related miRNAs. Based on the assumption that functionally related miRNAs tend to be involved disease, the model of LMTRDA is proposed to prioritize the underlying miRNA-disease associations by fusing multi-source information including miRNA sequences, miRNA functional similarity, disease semantic similarity, and known miRNA-disease associations. Through cross validation, the promising results demonstrated the effectiveness of the proposed model. We further implemented the case studies of three important human complex diseases including Breast Neoplasms, Breast Neoplasms and Lymphoma, 28, 27 and 26 of top-30 predicted miRNA-disease associations have been manually confirmed based on recent experimental reports. It is anticipated that LMTRDA model could prioritize the most potential miRNA-disease associations on a large scale for advancing the progress of biological experiment validation in the future, which could further contribute to the understanding of complex disease mechanisms.
引用
收藏
页数:18
相关论文
共 19 条
  • [1] MDA-CF: Predicting MiRNA-Disease associations based on a cascade forest model by fusing multi-source information
    Dai, Qiuying
    Chu, Yanyi
    Li, Zhiqi
    Zhao, Yusong
    Mao, Xueying
    Wang, Yanjing
    Xiong, Yi
    Wei, Dong-Qing
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
  • [2] Predicting miRNA-disease associations based on graph attention network with multi-source information
    Li, Guanghui
    Fang, Tao
    Zhang, Yuejin
    Liang, Cheng
    Xiao, Qiu
    Luo, Jiawei
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [3] Generative Adversarial Matrix Completion Network based on Multi-Source Data Fusion for miRNA-Disease Associations Prediction
    Wang, ShuDong
    Li, YunYin
    Zhang, YuanYuan
    Pang, ShanChen
    Qiao, SiBo
    Zhang, Yu
    Wang, FuYu
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [4] A path-based measurement for human miRNA functional similarities using miRNA-disease associations
    Ding, Pingjian
    Luo, Jiawei
    Xiao, Qiu
    Chen, Xiangtao
    SCIENTIFIC REPORTS, 2016, 6
  • [5] MDformer: A transformer-based method for predicting miRNA-Disease associations using multi-source feature fusion and maximal meta-path instances encoding
    Dong, Benzhi
    Sun, Weidong
    Xu, Dali
    Wang, Guohua
    Zhang, Tianjiao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [6] Predicting miRNA-disease associations based on multi-view information fusion
    Xie, Xuping
    Wang, Yan
    Sheng, Nan
    Zhang, Shuangquan
    Cao, Yangkun
    Fu, Yuan
    FRONTIERS IN GENETICS, 2022, 13
  • [7] Predicting miRNA-disease associations using improved random walk with restart and integrating multiple similarities
    Van Tinh Nguyen
    Thi Tu Kien Le
    Khoat Than
    Dang Hung Tran
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [8] An Ensemble Approach Based on Multi-Source Information to Predict Drug-MiRNA Associations via Convolutional Neural Networks
    Deepthi, K.
    Jereesh, A. S.
    IEEE ACCESS, 2021, 9 (09): : 38331 - 38341
  • [9] Identification of MiRNA-Disease Associations Based on Information of Multi-Module and Meta-Path
    Li, Zihao
    Huang, Xing
    Shi, Yakun
    Zou, Xiaoyong
    Li, Zhanchao
    Dai, Zong
    MOLECULES, 2022, 27 (14):
  • [10] MLRDFM: a multi-view Laplacian regularized DeepFM model for predicting miRNA-disease associations
    Ding, Yulian
    Lei, Xiujuan
    Liao, Bo
    Wu, Fang-Xiang
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)