Machine learning-based prediction of survival prognosis in esophageal squamous cell carcinoma

被引:24
作者
Zhang, Kaijiong [1 ]
Ye, Bo [1 ]
Wu, Lichun [1 ]
Ni, Sujiao [1 ]
Li, Yang [3 ]
Wang, Qifeng [2 ]
Zhang, Peng [3 ]
Wang, Dongsheng [1 ]
机构
[1] Univ Elect Sci & Technol China, Affiliated Canc Hosp, Sichuan Canc Ctr, Dept Clin Lab,Sichuan Clin Res Ctr Canc,Sichuan Ca, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Affiliated Canc Hosp, Sichuan Canc Ctr, Dept Radiat Oncol,Sichuan Clin Res Ctr Canc,Sichua, Chengdu, Peoples R China
[3] Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Oncol, Wuhan, Peoples R China
关键词
CANCER;
D O I
10.1038/s41598-023-40780-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The current prognostic tools for esophageal squamous cell carcinoma (ESCC) lack the necessary accuracy to facilitate individualized patient management strategies. To address this issue, this study was conducted to develop a machine learning (ML) prediction model for ESCC patients' survival management. Six ML approaches, including Rpart, Elastic Net, GBM, Random Forest, GLMboost, and the machine learning-extended CoxPH method, were employed to develop risk prediction models. The model was trained on a dataset of 1954 ESCC patients with 27 clinical features and validated on a dataset of 487 ESCC patients. The discriminative performance of the models was assessed using the concordance index (C-index). The best performing model was used for risk stratification and clinical evaluation. The study found that N stage, T stage, surgical margin, tumor grade, tumor length, sex, MPV, AST, FIB, and Mg are the important feature for ESCC patients' survival. The machine learning-extended CoxPH model, Elastic Net, and Random Forest had similar performance in predicting the mortality risk of ESCC patients, and outperformed GBM, GLMboost, and Rpart. The risk scores derived from the CoxPH model effectively stratified ESCC patients into low-, intermediate-, and high-risk groups with distinctly different 3-year overall survival (OS) probabilities of 80.8%, 58.2%, and 29.5%, respectively. This risk stratification was also observed in the validation cohort. Furthermore, the risk model demonstrated greater discriminative ability and net benefit than the AJCC8th stage, suggesting its potential as a prognostic tool for predicting survival events and guiding clinical decision-making. The classical algorithm of the CoxPH method was also found to be sufficiently good for interpretive studies.
引用
收藏
页数:12
相关论文
共 36 条
  • [1] Machine Learning for Prediction of Survival Outcomes with Immune-Checkpoint Inhibitors in Urothelial Cancer
    Abuhelwa, Ahmad Y.
    Kichenadasse, Ganessan
    McKinnon, Ross A.
    Rowland, Andrew
    Hopkins, Ashley M.
    Sorich, Michael J.
    [J]. CANCERS, 2021, 13 (09)
  • [2] Clinical and Prognostic Features of Patients With Esophageal Cancer and Multiple Primary Cancers A Retrospective Single-institution Study
    Baba, Yoshifumi
    Yoshida, Naoya
    Kinoshita, Koichi
    Iwatsuki, Masaaki
    Yamashita, Yo-ichi
    Chikamoto, Akira
    Watanabe, Masayuki
    Baba, Hideo
    [J]. ANNALS OF SURGERY, 2018, 267 (03) : 478 - 483
  • [3] Breiman L, 1984, Cart J. Classification and regression trees
  • [4] Boosting with the L2 loss:: Regression and classification
    Bühlmann, P
    Yu, B
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) : 324 - 339
  • [5] Temporal Changes in Esophageal Cancer Mortality by Geographic Region: A Population-based Analysis
    Chitti, Bhargava
    Pham, Anthony
    Marcott, Stephen
    Wang, Xin
    Potters, Louis
    Wernicke, A. Gabriella
    Parashar, Bhupesh
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2018, 10 (11)
  • [6] D'Ascenzo F, 2021, LANCET, V397, P199, DOI 10.1016/S0140-6736(20)32519-8
  • [7] Machine learning-based prediction of survival prognosis in cervical cancer
    Ding, Dongyan
    Lang, Tingyuan
    Zou, Dongling
    Tan, Jiawei
    Chen, Jia
    Zhou, Lei
    Wang, Dong
    Li, Rong
    Li, Yunzhe
    Liu, Jingshu
    Ma, Cui
    Zhou, Qi
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [8] Regularization Paths for Generalized Linear Models via Coordinate Descent
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Rob
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01): : 1 - 22
  • [9] Stochastic gradient boosting
    Friedman, JH
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) : 367 - 378
  • [10] Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data
    Gould, Michael K.
    Huang, Brian Z.
    Tammemagi, Martin C.
    Kinar, Yaron
    Shiff, Ron
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 204 (04) : 445 - 453