HOPEXGB: A Consensual Model for Predicting miRNA/lncRNA-Disease Associations Using a Heterogeneous Disease-miRNA-lncRNA Information Network

被引:9
作者
He, Jian [1 ]
Li, Menglong [1 ]
Qiu, Jiangguo [1 ]
Pu, Xuemei [1 ]
Guo, Yanzhi [1 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
关键词
LONG NONCODING RNAS; EXPRESSION; CANCER; DATABASE;
D O I
10.1021/acs.jcim.3c00856
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Predicting disease-related microRNAs(miRNAs) and longnoncodingRNAs (lncRNAs) is crucial to find new biomarkers for the prevention,diagnosis, and treatment of complex human diseases. Computationalpredictions for miRNA/lncRNA-disease associations are of great practicalsignificance, since traditional experimental detection is expensiveand time-consuming. In this paper, we proposed a consensual machine-learningtechnique-based prediction approach to identify disease-related miRNAsand lncRNAs by high-order proximity preserved embedding (HOPE) andeXtreme Gradient Boosting (XGB), named HOPEXGB. By connecting lncRNA,miRNA, and disease nodes based on their correlations and relationships,we first created a heterogeneous disease-miRNA-lncRNA (DML) informationnetwork to achieve an effective fusion of information on similarities,correlations, and interactions among miRNAs, lncRNAs, and diseases.In addition, a more rational negative data set was generated basedon the similarities of unknown associations with the known ones, soas to effectively reduce the false negative rate in the data set formodel construction. By 10-fold cross-validation, HOPE shows betterperformance than other graph embedding methods. The final consensualHOPEXGB model yields robust performance with a mean prediction accuracyof 0.9569 and also demonstrates high sensitivity and specificity advantagescompared to lncRNA/miRNA-specific predictions. Moreover, it is superiorto other existing methods and gives promising performance on the externaltesting data, indicating that integrating the information on lncRNA-miRNAinteractions and the similarities of lncRNAs/miRNAs is beneficialfor improving the prediction performance of the model. Finally, casestudies on lung, stomach, and breast cancers indicate that HOPEXGBcould be a powerful tool for preclinical biomarker detection and bioexperimentpreliminary screening for the diagnosis and prognosis of cancers.HOPEXGB is publicly available at https://github.com/airpamper/HOPEXGB.
引用
收藏
页码:2863 / 2877
页数:15
相关论文
共 50 条
[31]   Inferring novel lncRNA-disease associations based on a random walk model of a lncRNA functional similarity network [J].
Sun, Jie ;
Shi, Hongbo ;
Wang, Zhenzhen ;
Zhang, Changjian ;
Liu, Lin ;
Wang, Letian ;
He, Weiwei ;
Hao, Dapeng ;
Liu, Shulin ;
Zhou, Meng .
MOLECULAR BIOSYSTEMS, 2014, 10 (08) :2074-2081
[32]   Predicting LncRNA-Disease Association by a Random Walk With Restart on Multiplex and Heterogeneous Networks [J].
Yao, Yuhua ;
Ji, Binbin ;
Lv, Yaping ;
Li, Ling ;
Xiang, Ju ;
Liao, Bo ;
Gao, Wei .
FRONTIERS IN GENETICS, 2021, 12
[33]   LncRNA-disease association prediction based on neighborhood information aggregation in neural network [J].
Chen, Hongjie ;
Zhang, Xuan ;
Song, Tao ;
Wang, Xun ;
Zeng, Xiangxiang ;
Rodriguez-Paton, Alfonso .
PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, :175-178
[34]   LncRNA-Disease Associations Prediction Using Bipartite Local Model With Nearest Profile-Based Association Inferring [J].
Cui, Zhen ;
Liu, Jin-Xing ;
Gao, Ying-Lian ;
Zhu, Rong ;
Yuan, Sha-Sha .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (05) :1519-1527
[35]   Predicting lncRNA-disease Association based on Extreme Gradient Boosting [J].
Tang, Xi ;
Li, Menglu ;
Zhang, Wei ;
Xia, Junfeng .
PROCEEDINGS OF 2020 10TH INTERNATIONAL CONFERENCE ON BIOSCIENCE, BIOCHEMISTRY AND BIOINFORMATICS (ICBBB 2020), 2020, :69-73
[36]   Dual Attention Mechanisms and Feature Fusion Networks Based Method for Predicting LncRNA-Disease Associations [J].
Liu, Yu ;
Yu, Yingying ;
Zhao, Shimin .
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (02) :358-371
[37]   Laplacian normalization and bi-random walks on heterogeneous networks for predicting lncRNA-disease associations [J].
Wen, Yaping ;
Han, Guosheng ;
Anh, Vo V. .
BMC SYSTEMS BIOLOGY, 2018, 12
[38]   Predicting LncRNA-Disease Association Based on Generative Adversarial Network [J].
Du, Biao ;
Tang, Lin ;
Liu, Lin ;
Zhou, Wei .
CURRENT GENE THERAPY, 2022, 22 (02) :144-151
[39]   HAUBRW: Hybrid algorithm and unbalanced bi-random walk for predicting lncRNA-disease associations [J].
Xie, Guobo ;
Wu, Changhai ;
Gu, Guosheng ;
Huang, Bin .
GENOMICS, 2020, 112 (06) :4777-4787
[40]   Predicting lncRNA-disease associations using multiple metapaths in hierarchical graph attention networks [J].
Yao, Dengju ;
Deng, Yuexiao ;
Zhan, Xiaojuan ;
Zhan, Xiaorong .
BMC BIOINFORMATICS, 2024, 25 (01)