Explainable machine learning model identified potential biomarkers in liver cancer survival prediction

被引:1
作者
Pan, Qi [1 ]
Hounye, Alphonse Houssou [1 ]
Miao, Kexin [1 ]
Su, Liuyan [1 ]
Wang, Jiaoju [1 ]
Hou, Muzhou [1 ]
Xiong, Li [2 ,3 ]
机构
[1] Cent South Univ, Sch Math & Stat, Changsha 410083, Peoples R China
[2] Cent South Univ, Xiangya Hosp 2, Dept Gen Surg, Changsha 410011, Peoples R China
[3] Hunan Clin Res Ctr Intelligent Gen Surg, Changsha 410011, Peoples R China
关键词
Random Forest; XGBoost; Support Vector Machine(SVM); SHAP; Immunogenic Cell Death (ICD); Prognostic model; CEP55;
D O I
10.1016/j.bspc.2024.106504
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Liver cancer is a malignant tumor with a high incidence, and common treatments include surgical resection, ablation, arterial catheterization, and liver transplantation. Enhancing the clinical evaluation and therapy management of LIHC is a crucial matter, and when incorporating machine learning methods into decision-making procedures, it is crucial to consider the comprehensibility of the models. In this current study, the SHapley Additive exPlanation (SHAP) technique was applied to interpret a gradient-boosting decision tree (XGBoost) model utilizing the Cancer Genome Atlas (TCGA) data for interpreting survival black-box models to identify the potential biomarkers for liver cancer survival prediction. Methods: The TCGA database is utilized to access expression data and clinical information for liver cancer samples, while Immunogenic Cell Death (ICD)-related genes were retrieved from the literature. Gene screening using bioinformatics methods and machine learning methods. The screened differentially expressed genes (DEGs) and ICDs were jointly constructed as the SurvMLSHAP model, and the SurvMLSHAP score was calculated. Three methods, bayesian optimization, random search, and genetic algorithm were used for parameter optimization. Eight machine learning models were built to evaluate the model's superiority and select the best model based on the suggested model. Results: The SurvMLSHAP model output was interpreted using the XGBoost-based SHAP method to assess the influence and significance of each feature. Tests conducted on both synthetic and medical data validate the capability of SurvMLSHAP to identify factors that have a time-dependent impact. The C-index of the raw data and validation data were 0.6844 and 0.8167, respectively. Furthermore, the aggregation of SurvMLSHAP yields a more accurate assessment of variable relevance for prediction compared to other existing approaches. The features contributing to the XGBoost model were, in order CEP55, PPIA, TTC36, HSP90AA1, which could be used as predictors to assess the liver hepatocellular carcinoma(LIHC) cohort, while the putative molecular subgroups could provide new ideas for individualized treatment of LIHC. Conclusion: In this study, a risk prognostic model was constructed called SurvMLSHAP based on bioinformatics and machine learning methods and screened for ICD-related biomarkers to assess the prognostic outcome of LIHC patients, which can provide personalized treatment for clinical patients.
引用
收藏
页数:17
相关论文
共 33 条
  • [1] Ensemble Learning Framework with GLCM Texture Extraction for Early Detection of Lung Cancer on CT Images
    Althubiti, Sara A.
    Paul, Sanchita
    Mohanty, Rajanikanta
    Mohanty, Sachi Nandan
    Alenezi, Fayadh
    Polat, Kemal
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [2] Integrating single-cell and bulk expression data to identify and analyze cancer prognosis-related genes
    Bao, Shengbao
    Fan, Yaxin
    Mei, Yichao
    Gao, Junxiang
    [J]. HELIYON, 2024, 10 (04)
  • [3] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
    Barredo Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Del Ser, Javier
    Bennetot, Adrien
    Tabik, Siham
    Barbado, Alberto
    Garcia, Salvador
    Gil-Lopez, Sergio
    Molina, Daniel
    Benjamins, Richard
    Chatila, Raja
    Herrera, Francisco
    [J]. INFORMATION FUSION, 2020, 58 : 82 - 115
  • [4] Tumor cell-derived LC3B+extracellular vesicles mediate the crosstalk between tumor microenvironment and immunotherapy efficacy in hepatocellular carcinoma via the HSP90α-IL-6/IL-8 signaling axis
    Chen, Yong-Qiang
    Man, Zhong-Song
    Zheng, Lu
    Zhang, Yue
    Zhao, Cheng-Wen
    Ma, Yu-Ting
    Zhou, Juan
    Wang, Peng
    Yu, Yang
    Gu, Feng
    Niu, Guo-Ping
    [J]. CLINICAL IMMUNOLOGY, 2024, 261
  • [5] Downregulation of Peptidylprolyl isomerase A promotes cell death and enhances doxorubicin-induced apoptosis in hepatocellular carcinoma
    Cheng, Shaobing
    Luo, Mengchao
    Ding, Chaofeng
    Peng, Chuanhui
    Lv, Zhen
    Tong, Rongliang
    Xiao, Heng
    Xie, Haiyang
    Zhou, Lin
    Wu, Jian
    Zheng, Shusen
    [J]. GENE, 2016, 591 (01) : 236 - 244
  • [6] COX DR, 1972, J R STAT SOC B, V34, P187
  • [7] A robust gene signature for the prediction of early relapse in stage I-III colon cancer
    Dai, Weixing
    Li, Yaqi
    Mo, Shaobo
    Feng, Yang
    Zhang, Long
    Xu, Ye
    Li, Qingguo
    Cai, Guoxiang
    [J]. MOLECULAR ONCOLOGY, 2018, 12 (04) : 463 - 475
  • [8] MiRNA in cervical cancer: Diagnosis to therapy: Systematic review
    Endale, Hiwot Tezera
    Mariye, Yitbarek Fantahun
    Negash, Habtu Kifle
    Hassen, Fethiya Seid
    Asrat, Wastina Bitewlign
    Mengstie, Tiget Ayelgn
    Tesfaye, Winta
    [J]. HELIYON, 2024, 10 (03)
  • [9] RANDOM SURVIVAL FORESTS
    Ishwaran, Hemant
    Kogalur, Udaya B.
    Blackstone, Eugene H.
    Lauer, Michael S.
    [J]. ANNALS OF APPLIED STATISTICS, 2008, 2 (03) : 841 - 860
  • [10] Study on the Prognostic Values of TTC36 Correlated with Immune Infiltrates and Its Methylation in Hepatocellular Carcinoma
    Jing, Wei
    Peng, Ruoyu
    Li, Xiaogai
    Lv, Shaogang
    Duan, Yu
    Jiang, Shitao
    [J]. JOURNAL OF IMMUNOLOGY RESEARCH, 2022, 2022