HOPEXGB: A Consensual Model for Predicting miRNA/lncRNA-Disease Associations Using a Heterogeneous Disease-miRNA-lncRNA Information Network

被引:9
作者
He, Jian [1 ]
Li, Menglong [1 ]
Qiu, Jiangguo [1 ]
Pu, Xuemei [1 ]
Guo, Yanzhi [1 ]
机构
[1] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
关键词
LONG NONCODING RNAS; EXPRESSION; CANCER; DATABASE;
D O I
10.1021/acs.jcim.3c00856
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Predicting disease-related microRNAs(miRNAs) and longnoncodingRNAs (lncRNAs) is crucial to find new biomarkers for the prevention,diagnosis, and treatment of complex human diseases. Computationalpredictions for miRNA/lncRNA-disease associations are of great practicalsignificance, since traditional experimental detection is expensiveand time-consuming. In this paper, we proposed a consensual machine-learningtechnique-based prediction approach to identify disease-related miRNAsand lncRNAs by high-order proximity preserved embedding (HOPE) andeXtreme Gradient Boosting (XGB), named HOPEXGB. By connecting lncRNA,miRNA, and disease nodes based on their correlations and relationships,we first created a heterogeneous disease-miRNA-lncRNA (DML) informationnetwork to achieve an effective fusion of information on similarities,correlations, and interactions among miRNAs, lncRNAs, and diseases.In addition, a more rational negative data set was generated basedon the similarities of unknown associations with the known ones, soas to effectively reduce the false negative rate in the data set formodel construction. By 10-fold cross-validation, HOPE shows betterperformance than other graph embedding methods. The final consensualHOPEXGB model yields robust performance with a mean prediction accuracyof 0.9569 and also demonstrates high sensitivity and specificity advantagescompared to lncRNA/miRNA-specific predictions. Moreover, it is superiorto other existing methods and gives promising performance on the externaltesting data, indicating that integrating the information on lncRNA-miRNAinteractions and the similarities of lncRNAs/miRNAs is beneficialfor improving the prediction performance of the model. Finally, casestudies on lung, stomach, and breast cancers indicate that HOPEXGBcould be a powerful tool for preclinical biomarker detection and bioexperimentpreliminary screening for the diagnosis and prognosis of cancers.HOPEXGB is publicly available at https://github.com/airpamper/HOPEXGB.
引用
收藏
页码:2863 / 2877
页数:15
相关论文
共 50 条
[41]   DeepWalk based method to predict lncRNA-miRNA associations via lncRNA-miRNA-disease-protein-drug graph [J].
Yang, Long ;
Li, Li-Ping ;
Yi, Hai-Cheng .
BMC BIOINFORMATICS, 2022, 22 (SUPPL 12)
[42]   Two-Stage Inference for LncRNA-Disease Associations Based on Diverse Heterogeneous Information Sources [J].
Zhang, Yi ;
Chen, Min ;
Xie, Xiaolan ;
Shen, Xianhao ;
Wang, Yu .
IEEE ACCESS, 2021, 9 :16103-16113
[43]   MAGCNSE: predicting lncRNA-disease associations using multi-view attention graph convolutional network and stacking ensemble model [J].
Liang, Ying ;
Zhang, Ze-Qun ;
Liu, Nian-Nian ;
Wu, Ya-Nan ;
Gu, Chang-Long ;
Wang, Ying-Long .
BMC BIOINFORMATICS, 2022, 23 (01)
[44]   Predicting LncRNA-disease Association by Autoencoder and Rotation Forest [J].
Yang, Jincai ;
Ma, Shunping ;
Jiang, Xingpeng .
2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, :159-164
[45]   AMPFLDAP: Adaptive Message Passing and Feature Fusion on Heterogeneous Network for LncRNA-Disease Associations Prediction [J].
Su, Yansen ;
Liu, Jingjing ;
Wu, Qingwen ;
Gao, Zhen ;
Wang, Jing ;
Li, Haitao ;
Zheng, Chunhou .
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (03) :608-622
[46]   Multi-Label Fusion Collaborative Matrix Factorization for Predicting LncRNA-Disease Associations [J].
Gao, Ming-Ming ;
Cui, Zhen ;
Gao, Ying-Lian ;
Wang, Juan ;
Liu, Jin-Xing .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (03) :881-890
[47]   ALSBMF: Predicting lncRNA-Disease Associations by Alternating Least Squares Based on Matrix Factorization [J].
Zhu, Wen ;
Huang, Kaimei ;
Xiao, Xiaofang ;
Liao, Bo ;
Yao, Yuhua ;
Wu, Fang-Xiang .
IEEE ACCESS, 2020, 8 :26190-26198
[48]   LDAformer: predicting lncRNA-disease associations based on topological feature extraction and Transformer encoder [J].
Zhou, Yi ;
Wang, Xinyi ;
Yao, Lin ;
Zhu, Min .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
[49]   Finding potential lncRNA-disease associations using a boosting-based ensemble learning model [J].
Zhou, Liqian ;
Peng, Xinhuai ;
Zeng, Lijun ;
Peng, Lihong .
FRONTIERS IN GENETICS, 2024, 15
[50]   Prediction of lncRNA-disease associations based on matrix factorization and neural network [J].
Hu, Xiaocao ;
Wu, Haoyang ;
Liu, Yuxin .
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, :2765-2770