Predicting Colorectal Cancer Survival Using Time-to-Event Machine Learning: Retrospective Cohort Study

被引:7
|
作者
Yang, Xulin [1 ]
Qiu, Hang [1 ,2 ]
Wang, Liya [2 ]
Wang, Xiaodong [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 2006 Xiyuan Ave, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Big Data Res Ctr, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp, Dept Gastrointestinal Surg, Chengdu, Peoples R China
关键词
colorectal cancer; survival prediction; machine learning; time-to-event; SHAP; SHapley Additive exPlanations; DIAGNOSIS; MODELS;
D O I
10.2196/44417
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Machine learning (ML) methods have shown great potential in predicting colorectal cancer (CRC) survival. However, the ML models introduced thus far have mainly focused on binary outcomes and have not considered the time-to-event nature of this type of modeling. Objective: This study aims to evaluate the performance of ML approaches for modeling time-to-event survival data and develop transparent models for predicting CRC-specific survival. Methods: The data set used in this retrospective cohort study contains information on patients who were newly diagnosed with CRC between December 28, 2012, and December 27, 2019, at West China Hospital, Sichuan University. We assessed the performance of 6 representative ML models, including random survival forest (RSF), gradient boosting machine (GBM), DeepSurv, DeepHit, neural net-extended time-dependent Cox (or Cox-Time), and neural multitask logistic regression (N-MTLR) in predicting CRC-specific survival. Multiple imputation by chained equations method was applied to handle missing values in variables. Multivariable analysis and clinical experience were used to select significant features associated with CRC survival. Model performance was evaluated in stratified 5-fold cross-validation repeated 5 times by using the time-dependent concordance index, integrated Brier score, calibration curves, and decision curves. The SHapley Additive exPlanations method was applied to calculate feature importance. Results: A total of 2157 patients with CRC were included in this study. Among the 6 time-to-event ML models, the DeepHit model exhibited the best discriminative ability (time-dependent concordance index 0.789, 95% CI 0.779-0.799) and the RSF model produced better-calibrated survival estimates (integrated Brier score 0.096, 95% CI 0.094-0.099), but these are not statistically significant. Additionally, the RSF, GBM, DeepSurv, Cox-Time, and N-MTLR models have comparable predictive accuracy to the Cox Proportional Hazards model in terms of discrimination and calibration. The calibration curves showed that all the ML models exhibited good 5-year survival calibration. The decision curves for CRC-specific survival at 5 years showed that all the ML models, especially RSF, had higher net benefits than default strategies of treating all or no patients at a range of clinically reasonable risk thresholds. The SHapley Additive exPlanations method revealed that R0 resection, tumor-node-metastasis staging, and the number of positive lymph nodes were important factors for 5-year CRC-specific survival. Conclusions: This study showed the potential of applying time-to-event ML predictive algorithms to help predict CRC-specific survival. The RSF, GBM, Cox-Time, and N-MTLR algorithms could provide nonparametric alternatives to the Cox Proportional Hazards model in estimating the survival probability of patients with CRC. The transparent time-to-event ML models help clinicians to more accurately predict the survival rate for these patients and improve patient outcomes by enabling personalized treatment plans that are informed by explainable ML models.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Predicting early gastric cancer risk using machine learning: A population-based retrospective study
    Ke, Xing
    Cai, Xinyu
    Bian, Bingxian
    Shen, Yuanheng
    Zhou, Yunlan
    Liu, Wei
    Wang, Xu
    Shen, Lisong
    Yang, Junyao
    DIGITAL HEALTH, 2024, 10
  • [22] Using median survival in meta-analysis of experimental time-to-event data
    Hirst, Theodore C.
    Sena, Emily S.
    Macleod, Malcolm R.
    SYSTEMATIC REVIEWS, 2021, 10 (01)
  • [23] Learning the Treatment Impact on Time-to-Event Outcomes: The Transcarotid Artery Revascularization Simulated Cohort
    Martinez-Camblor, Pablo
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (19)
  • [24] Colorectal Cancer Presentation and Survival in Young Individuals: A Retrospective Cohort Study
    Ulanja, Mark B.
    Beutler, Bryce D.
    Rishi, Mohit
    Ogala, Chioma
    Patterson, Darryll R.
    Gullapalli, Nageshwara
    Ambika, Santhosh
    CANCERS, 2018, 10 (12)
  • [25] Machine learning–based survival models for predicting rehospitalization of older hip fracture patients: a retrospective cohort study
    Juhan Oh
    Minah Park
    Yonghan Cha
    Jae-Hyun Kim
    Seung Hoon Kim
    BMC Musculoskeletal Disorders, 26 (1)
  • [26] Predicting complications after laparoscopic surgery for ureteropelvic junction obstruction using machine learning models: a retrospective cohort study
    Zhang, Xintao
    Sun, Dong
    Zhou, Yu
    Xu, Qiongqian
    Ren, Xue
    Han, Jichang
    Ma, Chuncan
    Ma, Guohua
    Sun, Zhihao
    Jia, Yu
    Zhou, Zhihang
    Liu, Xiaoyang
    Zhang, Qiangye
    Li, Aiwu
    WORLD JOURNAL OF UROLOGY, 2025, 43 (01)
  • [27] Comparison of time-to-event machine learning models in predicting biliary complication and mortality rate in liver transplant patients
    Andishgar, Aref
    Bazmi, Sina
    Lankarani, Kamran B.
    Taghavi, Seyed Alireza
    Imanieh, Mohammad Hadi
    Sivandzadeh, Gholamreza
    Saeian, Samira
    Dadashpour, Nazanin
    Shamsaeefar, Alireza
    Ravankhah, Mahdi
    Deylami, Hamed Nikoupour
    Tabrizi, Reza
    Imanieh, Mohammad Hossein
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [28] The association of time between diagnosis and major resection with poorer colorectal cancer survival: a retrospective cohort study
    Redaniel, Maria Theresa
    Martin, Richard M.
    Blazeby, Jane M.
    Wade, Julia
    Jeffreys, Mona
    BMC CANCER, 2014, 14
  • [29] The association of time between diagnosis and major resection with poorer colorectal cancer survival: a retrospective cohort study
    Maria Theresa Redaniel
    Richard M Martin
    Jane M Blazeby
    Julia Wade
    Mona Jeffreys
    BMC Cancer, 14
  • [30] Explainable machine learning for predicting lung metastasis of colorectal cancer
    Zhentian Guo
    Zongming Zhang
    Limin Liu
    Yue Zhao
    Zhuo Liu
    Chong Zhang
    Hui Qi
    Jinqiu Feng
    Peijie Yao
    Scientific Reports, 15 (1)