A pathway-based computational framework for identification of a new modal of multi-omics biomarkers and its application in esophageal cancer

被引:1
|
作者
Zhou, Qi [1 ]
Ye, Weicai [2 ,3 ]
Yu, Xiaolan [1 ,4 ]
Bao, Yun-Juan [1 ]
机构
[1] Hubei Univ, Sch Life Sci, State Key Lab Biocatalysis & Enzyme Engn, Wuhan, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Sci, Guangzhou, Peoples R China
[3] Sun Yat Sen Univ, Natl Engn Lab Big Data Anal & Applicat, Guangzhou, Peoples R China
[4] Hubei Jiangxia Lab, Wuhan, Peoples R China
关键词
Multi-omics biomarkers; Machine learning; Pathway; Esophageal carcinoma; SQUAMOUS-CELL CARCINOMA; EXPRESSION PROFILES; EARLY-DIAGNOSIS; PROGNOSIS; PACKAGE; GROWTH;
D O I
10.1016/j.cmpb.2024.108077
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: The pathway -based strategy has been recently proposed for identifying biomarkers with the advantages of higher biological interpretability and cross -data robustness than the conventional gene -based strategy. However, its utility in clinical applications has been limited due to the high computational complexity and ill-defined performance. Objective: The current study presents a machine learning -based computational framework using multi-omics data for identifying a new modal of biomarkers, called pathway -derived core biomarkers, which have the advantages of both gene -based and pathway -based biomarkers. Methods: Machine -learning methods and gene -pathway network were integrated to select the pathway -derived core biomarkers. Multiple machine -learning algorithms were used to construct and validate the diagnostic models of the biomarkers based on more than 1400 multi-omics clinical samples of esophageal squamous cell carcinoma (ESCC). Results: The results showed that the classifier models based on the new modal biomarkers achieved superior performance in the training datasets with an average AUC/accuracy of 0.98/0.95 and 0.89/0.81 for mRNAs and miRNA, respectively, higher than the currently known classifier models based on the conventional gene -based strategy and pathway -based strategy. In the testing cohorts, the AUC/accuracy increased by 6.1 %/7.3 % than the models based on the native gene -based biomarkers. The improved performance was further confirmed in independent validation cohorts. Specifically, the sensitivity/specificity increased by -3 % and the variance significantly decreased by -69 % compared with that of the native gene -based biomarkers. Importantly, the pathway -derived core biomarkers also recovered 45 % more previously reported biomarkers than the gene -based biomarkers and are more functionally relevant to the ESCC etiology (involved in 14 versus 7 pathways related with ESCC or other cancer), highlighting the cross -data robustness of this new modal of biomarkers via enhanced functional relevance. Conclusions: The results demonstrated that the new modal of biomarkers not only have improved predicting performance and robustness, but also exhibit higher functional interpretability thus leading to the potential application in cancer diagnosis.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] PathME: pathway based multi-modal sparse autoencoders for clustering of patient-level multi-omics data
    Lemsara, Amina
    Ouadfel, Salima
    Froehlich, Holger
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [22] PathME: pathway based multi-modal sparse autoencoders for clustering of patient-level multi-omics data
    Amina Lemsara
    Salima Ouadfel
    Holger Fröhlich
    BMC Bioinformatics, 21
  • [23] Machine learning algorithms and biomarkers identification for pancreatic cancer diagnosis using multi-omics data integration
    Rouzbahani, Arian Karimi
    Khalili-Tanha, Ghazaleh
    Rajabloo, Yasamin
    Khojasteh-Leylakoohi, Fatemeh
    Garjan, Hassan Shokri
    Nazari, Elham
    Avan, Amir
    PATHOLOGY RESEARCH AND PRACTICE, 2024, 263
  • [24] DeepOmix: A scalable and interpretable multi-omics deep learning framework and application in cancer survival analysis
    Zhao, Lianhe
    Dong, Qiongye
    Luo, Chunlong
    Wu, Yang
    Bu, Dechao
    Qi, Xiaoning
    Luo, Yufan
    Zhao, Yi
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 2719 - 2725
  • [25] MultiGATAE: A Novel Cancer Subtype Identification Method Based on Multi-Omics and Attention Mechanism
    Zhang, Ge
    Peng, Zhen
    Yan, Chaokun
    Wang, Jianlin
    Luo, Junwei
    Luo, Huimin
    FRONTIERS IN GENETICS, 2022, 13
  • [26] Identification of Prognostic Dosage-Sensitive Genes in Colorectal Cancer Based on Multi-Omics
    Chang, Zhiqiang
    Miao, Xiuxiu
    Zhao, Wenyuan
    FRONTIERS IN GENETICS, 2020, 10
  • [27] Identification of Pathway-Based Biomarkers with Crosstalk Analysis for Overall Survival Risk Prediction in Breast Cancer
    Liu, Xiaohua
    Su, Lili
    Li, Jingcong
    Ou, Guoping
    FRONTIERS IN GENETICS, 2021, 12
  • [28] A Deep Learning Fusion Clustering framework for breast cancer subtypes identification by integrating multi-omics data
    Liu Shuangshuang
    Qi Lin
    Tie Yun
    Liu Fenghui
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1710 - 1714
  • [29] A Detailed Catalogue of Multi-Omics Methodologies for Identification of Putative Biomarkers and Causal Molecular Networks in Translational Cancer Research
    Vlachavas, Efstathios Iason
    Bohn, Jonas
    Ueckert, Frank
    Nuernberg, Sylvia
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (06) : 1 - 41
  • [30] Multi-omics characterization of macrophage polarization-related features in osteoarthritis based on a machine learning computational framework
    Hu, Ping
    Li, Beining
    Yin, Zhenyu
    Peng, Peng
    Cao, Jiangang
    Xie, Wanyu
    Liu, Liang
    Cao, Fujiang
    Zhang, Bin
    HELIYON, 2024, 10 (09)