HOPEXGB: A Consensual Model for Predicting miRNA/lncRNA-Disease Associations Using a Heterogeneous Disease-miRNA-lncRNA Information Network

被引:7
作者
He, Jian [1 ]
Li, Menglong [1 ]
Qiu, Jiangguo [1 ]
Pu, Xuemei [1 ]
Guo, Yanzhi [1 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
关键词
LONG NONCODING RNAS; EXPRESSION; CANCER; DATABASE;
D O I
10.1021/acs.jcim.3c00856
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Predicting disease-related microRNAs(miRNAs) and longnoncodingRNAs (lncRNAs) is crucial to find new biomarkers for the prevention,diagnosis, and treatment of complex human diseases. Computationalpredictions for miRNA/lncRNA-disease associations are of great practicalsignificance, since traditional experimental detection is expensiveand time-consuming. In this paper, we proposed a consensual machine-learningtechnique-based prediction approach to identify disease-related miRNAsand lncRNAs by high-order proximity preserved embedding (HOPE) andeXtreme Gradient Boosting (XGB), named HOPEXGB. By connecting lncRNA,miRNA, and disease nodes based on their correlations and relationships,we first created a heterogeneous disease-miRNA-lncRNA (DML) informationnetwork to achieve an effective fusion of information on similarities,correlations, and interactions among miRNAs, lncRNAs, and diseases.In addition, a more rational negative data set was generated basedon the similarities of unknown associations with the known ones, soas to effectively reduce the false negative rate in the data set formodel construction. By 10-fold cross-validation, HOPE shows betterperformance than other graph embedding methods. The final consensualHOPEXGB model yields robust performance with a mean prediction accuracyof 0.9569 and also demonstrates high sensitivity and specificity advantagescompared to lncRNA/miRNA-specific predictions. Moreover, it is superiorto other existing methods and gives promising performance on the externaltesting data, indicating that integrating the information on lncRNA-miRNAinteractions and the similarities of lncRNAs/miRNAs is beneficialfor improving the prediction performance of the model. Finally, casestudies on lung, stomach, and breast cancers indicate that HOPEXGBcould be a powerful tool for preclinical biomarker detection and bioexperimentpreliminary screening for the diagnosis and prognosis of cancers.HOPEXGB is publicly available at https://github.com/airpamper/HOPEXGB.
引用
收藏
页码:2863 / 2877
页数:15
相关论文
共 50 条
  • [1] Predicting lncRNA-disease associations and constructing lncRNA functional similarity network based on the information of miRNA
    Chen, Xing
    SCIENTIFIC REPORTS, 2015, 5
  • [2] A Novel Model for Predicting LncRNA-disease Associations Based on the LncRNA-MiRNA-disease Interactive Network
    Wang, Lei
    Xuan, Zhanwei
    Zhou, Shunxian
    Kuang, Linai
    Pei, Tingrui
    CURRENT BIOINFORMATICS, 2019, 14 (03) : 269 - 278
  • [3] Geometric complement heterogeneous information and random forest for predicting lncRNA-disease associations
    Yao, Dengju
    Zhang, Tao
    Zhan, Xiaojuan
    Zhang, Shuli
    Zhan, Xiaorong
    Zhang, Chao
    FRONTIERS IN GENETICS, 2022, 13
  • [4] HEGANLDA: A Computational Model for Predicting Potential Lncrna-Disease Associations Based On Multiple Heterogeneous Networks
    Li, Jianwei
    Wang, Duanyang
    Yang, Zhenwu
    Liu, Ming
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 388 - 398
  • [5] Global network random walk for predicting potential human lncRNA-disease associations
    Gu, Changlong
    Liao, Bo
    Li, Xiaoying
    Cai, Lijun
    Li, Zejun
    Li, Keqin
    Yang, Jialiang
    SCIENTIFIC REPORTS, 2017, 7
  • [6] GCNFORMER: graph convolutional network and transformer for predicting lncRNA-disease associations
    Yao, Dengju
    Li, Bailin
    Zhan, Xiaojuan
    Zhan, Xiaorong
    Yu, Liyang
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [7] Predicting lncRNA-disease associations based on heterogeneous graph convolutional generative adversarial network
    Lu, Zhonghao
    Zhong, Hua
    Tang, Lin
    Luo, Jing
    Zhou, Wei
    Liu, Lin
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (11)
  • [8] Predicting lncRNA-disease associations using network topological similarity based on deep mining heterogeneous networks
    Zhang Hui
    Liang Yanchun
    Peng Cheng
    Han Siyu
    Du Wei
    Li Ying
    MATHEMATICAL BIOSCIENCES, 2019, 315
  • [9] IDLDA: An Improved Diffusion Model for Predicting LncRNA-Disease Associations
    Wang, Qi
    Yan, Guiying
    FRONTIERS IN GENETICS, 2019, 10
  • [10] NELDA: Prediction of LncRNA-disease Associations With Network Embedding
    Li Wei-Na
    Fan Xiao-Nan
    Zhang Shao-Wu
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2022, 49 (07) : 1369 - 1380