DeepKEGG: a multi-omics data integration framework with biological insights for cancer recurrence prediction and biomarker discovery

被引:15
作者
Lan, Wei [1 ]
Liao, Haibo [2 ]
Chen, Qingfeng [3 ]
Zhu, Lingzhi [4 ]
Pan, Yi [5 ]
Chen, Yi-Ping Phoebe [6 ]
机构
[1] Guangxi Univ, Sch Comp Elect & Informat, Nanning, Peoples R China
[2] Guangxi Univ, Comp Technol, Nanning, Peoples R China
[3] Guangxi Univ, State Key Lab Conservat & Utilizat Subtrop Agrobio, Nanning, Peoples R China
[4] Hunan Inst Technol, Sch Comp & Informat Sci, Hengyang 421002, Peoples R China
[5] Chinese Acad Sci, Shenzhen Inst Adv Technol, Sch Comp Sci & Control Engn, Shenzhen, Peoples R China
[6] La Trobe Univ, Dept Comp Sci & Informat Technol, Bundoora, Vic, Australia
基金
中国国家自然科学基金;
关键词
cancer recurrence prediction; interpretability of deep learning; self-attention mechanism; multi-omics data integration; HEPATOCELLULAR-CARCINOMA; BLADDER-CANCER; SIGNALING PATHWAY; PROLIFERATION; ACTIVATION; SURVIVAL;
D O I
10.1093/bib/bbae185
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Deep learning-based multi-omics data integration methods have the capability to reveal the mechanisms of cancer development, discover cancer biomarkers and identify pathogenic targets. However, current methods ignore the potential correlations between samples in integrating multi-omics data. In addition, providing accurate biological explanations still poses significant challenges due to the complexity of deep learning models. Therefore, there is an urgent need for a deep learning-based multi-omics integration method to explore the potential correlations between samples and provide model interpretability. Herein, we propose a novel interpretable multi-omics data integration method (DeepKEGG) for cancer recurrence prediction and biomarker discovery. In DeepKEGG, a biological hierarchical module is designed for local connections of neuron nodes and model interpretability based on the biological relationship between genes/miRNAs and pathways. In addition, a pathway self-attention module is constructed to explore the correlation between different samples and generate the potential pathway feature representation for enhancing the prediction performance of the model. Lastly, an attribution-based feature importance calculation method is utilized to discover biomarkers related to cancer recurrence and provide a biological interpretation of the model. Experimental results demonstrate that DeepKEGG outperforms other state-of-the-art methods in 5-fold cross validation. Furthermore, case studies also indicate that DeepKEGG serves as an effective tool for biomarker discovery. The code is available at https://github.com/lanbiolab/DeepKEGG.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] asmbPLS: biomarker identification and patient survival prediction with multi-omics data
    Zhang, Runzhi
    Datta, Susmita
    FRONTIERS IN GENETICS, 2024, 15
  • [2] Improving prediction performance of colon cancer prognosis based on the integration of clinical and multi-omics data
    Tong, Danyang
    Tian, Yu
    Zhou, Tianshu
    Ye, Qiancheng
    Li, Jun
    Ding, Kefeng
    Li, Jingsong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [3] Integrating multi-omics data through deep learning for accurate cancer prognosis prediction
    Chai, Hua
    Zhou, Xiang
    Zhang, Zhongyue
    Rao, Jiahua
    Zhao, Huiying
    Yang, Yuedong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [4] Leveraging complementary multi-omics data integration methods for mechanistic insights in kidney diseases
    Alakwaa, Fadhl
    Das, Vivek
    Majumdar, Arindam
    Nair, Viji
    Fermin, Damian
    Dey, Asim B.
    Slidel, Timothy
    Reilly, Dermot F.
    Myshkin, Eugene
    Duffin, Kevin L.
    Chen, Yu
    Bitzer, Markus
    Pennathur, Subramaniam
    Brosius, Frank C.
    Kretzler, Matthias
    Ju, Wenjun
    Karihaloo, Anil
    Eddy, Sean
    JCI INSIGHT, 2025, 10 (05)
  • [5] Multi-omics Data Integration for Identifying Osteoporosis Biomarkers and Their Biological Interaction and Causal Mechanisms
    Qiu, Chuan
    Yu, Fangtang
    Su, Kuanjui
    Zhao, Qi
    Zhang, Lan
    Xu, Chao
    Hu, Wenxing
    Wang, Zun
    Zhao, Lanjuan
    Tian, Qing
    Wang, Yuping
    Deng, Hongwen
    Shen, Hui
    ISCIENCE, 2020, 23 (02)
  • [6] MDICC: novel method for multi-omics data integration and cancer subtype identification
    Yang, Ying
    Tian, Sha
    Qiu, Yushan
    Zhao, Pu
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [7] Identification of Pan-Cancer Prognostic Biomarkers Through Integration of Multi-Omics Data
    Zhao, Ning
    Guo, Maozu
    Wang, Kuanquan
    Zhang, Chunlong
    Liu, Xiaoyan
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8
  • [8] Integration of Multi-Omics Data for the Classification of Glioma Types and Identification of Novel Biomarkers
    Vieira, Francisca G.
    Bispo, Regina
    Lopes, Marta B.
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2024, 18
  • [9] Recent Web Platforms for Multi-Omics Integration Unlocking Biological Complexity
    Papadaki, Eugenia
    Kakkos, Ioannis
    Vlamos, Panagiotis
    Petropoulou, Ourania
    Miloulis, Stavros T.
    Palamas, Stergios
    Vrahatis, Aristidis G.
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [10] MOFNet: A Deep Learning Framework of Integrating Multi-omics Data for Breast Cancer Diagnosis
    Zhang, Chunxiao
    Li, Pengpai
    Sun, Duanchen
    Liu, Zhi-Ping
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 727 - 738