Predicting Colorectal Cancer Survival Using Time-to-Event Machine Learning: Retrospective Cohort Study

被引:7
|
作者
Yang, Xulin [1 ]
Qiu, Hang [1 ,2 ]
Wang, Liya [2 ]
Wang, Xiaodong [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 2006 Xiyuan Ave, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Big Data Res Ctr, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp, Dept Gastrointestinal Surg, Chengdu, Peoples R China
关键词
colorectal cancer; survival prediction; machine learning; time-to-event; SHAP; SHapley Additive exPlanations; DIAGNOSIS; MODELS;
D O I
10.2196/44417
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Machine learning (ML) methods have shown great potential in predicting colorectal cancer (CRC) survival. However, the ML models introduced thus far have mainly focused on binary outcomes and have not considered the time-to-event nature of this type of modeling. Objective: This study aims to evaluate the performance of ML approaches for modeling time-to-event survival data and develop transparent models for predicting CRC-specific survival. Methods: The data set used in this retrospective cohort study contains information on patients who were newly diagnosed with CRC between December 28, 2012, and December 27, 2019, at West China Hospital, Sichuan University. We assessed the performance of 6 representative ML models, including random survival forest (RSF), gradient boosting machine (GBM), DeepSurv, DeepHit, neural net-extended time-dependent Cox (or Cox-Time), and neural multitask logistic regression (N-MTLR) in predicting CRC-specific survival. Multiple imputation by chained equations method was applied to handle missing values in variables. Multivariable analysis and clinical experience were used to select significant features associated with CRC survival. Model performance was evaluated in stratified 5-fold cross-validation repeated 5 times by using the time-dependent concordance index, integrated Brier score, calibration curves, and decision curves. The SHapley Additive exPlanations method was applied to calculate feature importance. Results: A total of 2157 patients with CRC were included in this study. Among the 6 time-to-event ML models, the DeepHit model exhibited the best discriminative ability (time-dependent concordance index 0.789, 95% CI 0.779-0.799) and the RSF model produced better-calibrated survival estimates (integrated Brier score 0.096, 95% CI 0.094-0.099), but these are not statistically significant. Additionally, the RSF, GBM, DeepSurv, Cox-Time, and N-MTLR models have comparable predictive accuracy to the Cox Proportional Hazards model in terms of discrimination and calibration. The calibration curves showed that all the ML models exhibited good 5-year survival calibration. The decision curves for CRC-specific survival at 5 years showed that all the ML models, especially RSF, had higher net benefits than default strategies of treating all or no patients at a range of clinically reasonable risk thresholds. The SHapley Additive exPlanations method revealed that R0 resection, tumor-node-metastasis staging, and the number of positive lymph nodes were important factors for 5-year CRC-specific survival. Conclusions: This study showed the potential of applying time-to-event ML predictive algorithms to help predict CRC-specific survival. The RSF, GBM, Cox-Time, and N-MTLR algorithms could provide nonparametric alternatives to the Cox Proportional Hazards model in estimating the survival probability of patients with CRC. The transparent time-to-event ML models help clinicians to more accurately predict the survival rate for these patients and improve patient outcomes by enabling personalized treatment plans that are informed by explainable ML models.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Automated machine learning model for predicting anastomotic strictures after esophageal cancer surgery: a retrospective cohort study
    Junxi Hu
    Qingwen Liu
    Wenbo He
    Jun Wu
    Dong Zhang
    Chao Sun
    Shichun Lu
    Xiaolin Wang
    Yusheng Shu
    Surgical Endoscopy, 2025, 39 (6) : 3737 - 3748
  • [42] Beyond the Cox Model: Applying Machine Learning Techniques with Time-to-Event Data
    Pierri, Francesca
    Perri, Damiano
    Caroni, Chrys
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT I, 2024, 14815 : 412 - 427
  • [43] Modeling time-to-event (survival) data using classification tree analysis
    Linden, Ariel
    Yarnold, Paul R.
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2017, 23 (06) : 1299 - 1308
  • [44] Machine Learning Model for Predicting Pheochromocytomas/Paragangliomas Surgery Difficulty: A Retrospective Cohort Study
    Yubing Zhang
    Qikun Guo
    Shurong Li
    Zhiqiang Zhang
    Fangzheng Xiang
    Wenhui Su
    Yukun Wu
    Jiajie Yu
    Yun Xie
    Cheng Luo
    Fufu Zheng
    Annals of Surgical Oncology, 2025, 32 (7) : 4790 - 4803
  • [45] Predicting the risk of pulmonary infection after kidney transplantation using machine learning methods: a retrospective cohort study
    Wu, Xiaoting
    Zhang, Hailing
    Cai, Minglong
    Zhang, Ying
    Xu, Anlan
    INTERNATIONAL UROLOGY AND NEPHROLOGY, 2025, 57 (03) : 947 - 955
  • [46] Predicting operative time for metabolic and bariatric surgery using machine learning models: a retrospective observational study
    Kang, Dong-Won
    Zhou, Shouhao
    Niranjan, Suman
    Rogers, Ann
    Shen, Chan
    INTERNATIONAL JOURNAL OF SURGERY, 2024, 110 (04) : 1968 - 1974
  • [47] Using machine learning to identify risk factors for pancreatic cancer: a retrospective cohort study of real-world data
    Su, Na
    Tang, Rui
    Zhang, Yice
    Ni, Jiaqi
    Huang, Yimei
    Liu, Chunqi
    Xiao, Yuzhou
    Zhu, Baoting
    Zhao, Yinglan
    FRONTIERS IN PHARMACOLOGY, 2024, 15
  • [48] Prediction of Maternal Hemorrhage Using Machine Learning: Retrospective Cohort Study
    Westcott, Jill M.
    Hughes, Francine
    Liu, Wenke
    Grivainis, Mark
    Hoskins, Iffath
    Fenyo, David
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (07)
  • [49] A comparative study on prediction of survival event of heart failure patients using machine learning algorithms
    Mücella Özbay Karakuş
    Orhan Er
    Neural Computing and Applications, 2022, 34 : 13895 - 13908
  • [50] Using machine learning algorithms to predict colorectal cancer
    Xiao, Xingjian
    Hong, Bo
    Maqsood, Kubra
    Yi, Xiaohan
    Xie, Guoqun
    Zhao, Hailei
    Sun, Bo
    Mao, Jianying
    Liu, Shiyou
    Xu, Xianglong
    LANCET REGIONAL HEALTH-WESTERN PACIFIC, 2025, 55