A pathway-based computational framework for identification of a new modal of multi-omics biomarkers and its application in esophageal cancer

被引:1
|
作者
Zhou, Qi [1 ]
Ye, Weicai [2 ,3 ]
Yu, Xiaolan [1 ,4 ]
Bao, Yun-Juan [1 ]
机构
[1] Hubei Univ, Sch Life Sci, State Key Lab Biocatalysis & Enzyme Engn, Wuhan, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Sci, Guangzhou, Peoples R China
[3] Sun Yat Sen Univ, Natl Engn Lab Big Data Anal & Applicat, Guangzhou, Peoples R China
[4] Hubei Jiangxia Lab, Wuhan, Peoples R China
关键词
Multi-omics biomarkers; Machine learning; Pathway; Esophageal carcinoma; SQUAMOUS-CELL CARCINOMA; EXPRESSION PROFILES; EARLY-DIAGNOSIS; PROGNOSIS; PACKAGE; GROWTH;
D O I
10.1016/j.cmpb.2024.108077
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: The pathway -based strategy has been recently proposed for identifying biomarkers with the advantages of higher biological interpretability and cross -data robustness than the conventional gene -based strategy. However, its utility in clinical applications has been limited due to the high computational complexity and ill-defined performance. Objective: The current study presents a machine learning -based computational framework using multi-omics data for identifying a new modal of biomarkers, called pathway -derived core biomarkers, which have the advantages of both gene -based and pathway -based biomarkers. Methods: Machine -learning methods and gene -pathway network were integrated to select the pathway -derived core biomarkers. Multiple machine -learning algorithms were used to construct and validate the diagnostic models of the biomarkers based on more than 1400 multi-omics clinical samples of esophageal squamous cell carcinoma (ESCC). Results: The results showed that the classifier models based on the new modal biomarkers achieved superior performance in the training datasets with an average AUC/accuracy of 0.98/0.95 and 0.89/0.81 for mRNAs and miRNA, respectively, higher than the currently known classifier models based on the conventional gene -based strategy and pathway -based strategy. In the testing cohorts, the AUC/accuracy increased by 6.1 %/7.3 % than the models based on the native gene -based biomarkers. The improved performance was further confirmed in independent validation cohorts. Specifically, the sensitivity/specificity increased by -3 % and the variance significantly decreased by -69 % compared with that of the native gene -based biomarkers. Importantly, the pathway -derived core biomarkers also recovered 45 % more previously reported biomarkers than the gene -based biomarkers and are more functionally relevant to the ESCC etiology (involved in 14 versus 7 pathways related with ESCC or other cancer), highlighting the cross -data robustness of this new modal of biomarkers via enhanced functional relevance. Conclusions: The results demonstrated that the new modal of biomarkers not only have improved predicting performance and robustness, but also exhibit higher functional interpretability thus leading to the potential application in cancer diagnosis.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration
    Wieder, Cecilia
    Cooke, Juliette
    Frainay, Clement
    Poupin, Nathalie
    Bowler, Russell
    Jourdan, Fabien
    Kechris, Katerina J.
    Lai, Rachel P. J.
    Ebbels, Timothy
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (03)
  • [2] Metabolism pathway-based subtyping in endometrial cancer: An integrated study by multi-omics analysis and machine learning algorithms
    Liu, Xiaodie
    Wang, Wenhui
    Zhang, Xiaolei
    Liang, Jing
    Feng, Dingqing
    Li, Yuebo
    Xue, Ming
    Ling, Bin
    MOLECULAR THERAPY NUCLEIC ACIDS, 2024, 35 (02):
  • [3] Subtype-MGTP: a cancer subtype identification framework based on multi-omics translation
    Xie, Minzhu
    Kuang, Yabin
    Song, Mengyun
    Bao, Ergude
    BIOINFORMATICS, 2024, 40 (06)
  • [4] Computational identification and characterization of glioma candidate biomarkers through multi-omics integrative profiling
    Liu, Lin
    Wang, Guangyu
    Wang, Liguo
    Yu, Chunlei
    Li, Mengwei
    Song, Shuhui
    Hao, Lili
    Ma, Lina
    Zhang, Zhang
    BIOLOGY DIRECT, 2020, 15 (01)
  • [5] Computational identification and characterization of glioma candidate biomarkers through multi-omics integrative profiling
    Lin Liu
    Guangyu Wang
    Liguo Wang
    Chunlei Yu
    Mengwei Li
    Shuhui Song
    Lili Hao
    Lina Ma
    Zhang Zhang
    Biology Direct, 15
  • [6] Editorial: Identification of immune-related biomarkers for cancer diagnosis based on multi-omics data
    Cheng, Liang
    Zhang, Xin
    Li, Chuan-Xin
    Guo, Rui
    Zhao, Tianyi
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [7] Gene- and Pathway-Based Deep Neural Network for Multi-omics Data Integration to Predict Cancer Survival Outcomes
    Hao, Jie
    Masum, Mohammad
    Oh, Jung Hun
    Kang, Mingon
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2019, 2019, 11490 : 113 - 124
  • [8] Identification of novel prognostic biomarkers by integrating multi-omics data in gastric cancer
    Nannan Liu
    Yun Wu
    Weipeng Cheng
    Yuxuan Wu
    Liguo Wang
    Liwei Zhuang
    BMC Cancer, 21
  • [9] Identification of novel prognostic biomarkers by integrating multi-omics data in gastric cancer
    Liu, Nannan
    Wu, Yun
    Cheng, Weipeng
    Wu, Yuxuan
    Wang, Liguo
    Zhuang, Liwei
    BMC CANCER, 2021, 21 (01)
  • [10] Potential Biomarkers for Liver Cancer Diagnosis Based on Multi-Omics Strategy
    Chen, Fanghua
    Wang, Junming
    Wu, Yingcheng
    Gao, Qiang
    Zhang, Shu
    FRONTIERS IN ONCOLOGY, 2022, 12