Predicting Colorectal Cancer Survival Using Time-to-Event Machine Learning: Retrospective Cohort Study

被引:7
|
作者
Yang, Xulin [1 ]
Qiu, Hang [1 ,2 ]
Wang, Liya [2 ]
Wang, Xiaodong [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 2006 Xiyuan Ave, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Big Data Res Ctr, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp, Dept Gastrointestinal Surg, Chengdu, Peoples R China
关键词
colorectal cancer; survival prediction; machine learning; time-to-event; SHAP; SHapley Additive exPlanations; DIAGNOSIS; MODELS;
D O I
10.2196/44417
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Machine learning (ML) methods have shown great potential in predicting colorectal cancer (CRC) survival. However, the ML models introduced thus far have mainly focused on binary outcomes and have not considered the time-to-event nature of this type of modeling. Objective: This study aims to evaluate the performance of ML approaches for modeling time-to-event survival data and develop transparent models for predicting CRC-specific survival. Methods: The data set used in this retrospective cohort study contains information on patients who were newly diagnosed with CRC between December 28, 2012, and December 27, 2019, at West China Hospital, Sichuan University. We assessed the performance of 6 representative ML models, including random survival forest (RSF), gradient boosting machine (GBM), DeepSurv, DeepHit, neural net-extended time-dependent Cox (or Cox-Time), and neural multitask logistic regression (N-MTLR) in predicting CRC-specific survival. Multiple imputation by chained equations method was applied to handle missing values in variables. Multivariable analysis and clinical experience were used to select significant features associated with CRC survival. Model performance was evaluated in stratified 5-fold cross-validation repeated 5 times by using the time-dependent concordance index, integrated Brier score, calibration curves, and decision curves. The SHapley Additive exPlanations method was applied to calculate feature importance. Results: A total of 2157 patients with CRC were included in this study. Among the 6 time-to-event ML models, the DeepHit model exhibited the best discriminative ability (time-dependent concordance index 0.789, 95% CI 0.779-0.799) and the RSF model produced better-calibrated survival estimates (integrated Brier score 0.096, 95% CI 0.094-0.099), but these are not statistically significant. Additionally, the RSF, GBM, DeepSurv, Cox-Time, and N-MTLR models have comparable predictive accuracy to the Cox Proportional Hazards model in terms of discrimination and calibration. The calibration curves showed that all the ML models exhibited good 5-year survival calibration. The decision curves for CRC-specific survival at 5 years showed that all the ML models, especially RSF, had higher net benefits than default strategies of treating all or no patients at a range of clinically reasonable risk thresholds. The SHapley Additive exPlanations method revealed that R0 resection, tumor-node-metastasis staging, and the number of positive lymph nodes were important factors for 5-year CRC-specific survival. Conclusions: This study showed the potential of applying time-to-event ML predictive algorithms to help predict CRC-specific survival. The RSF, GBM, Cox-Time, and N-MTLR algorithms could provide nonparametric alternatives to the Cox Proportional Hazards model in estimating the survival probability of patients with CRC. The transparent time-to-event ML models help clinicians to more accurately predict the survival rate for these patients and improve patient outcomes by enabling personalized treatment plans that are informed by explainable ML models.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A retrospective cohort study of the influence of lifestyle factors on the survival of patients undergoing surgery for colorectal cancer
    Alexander, D.
    Allardice, G. M.
    Moug, S. J.
    Morrison, D. S.
    COLORECTAL DISEASE, 2017, 19 (06) : 544 - 550
  • [32] Predicting Metabolic Syndrome With Machine Learning Models Using a Decision Tree Algorithm: Retrospective Cohort Study
    Yu, Cheng-Sheng
    Lin, Yu-Jiun
    Lin, Chang-Hsien
    Wang, Sen-Te
    Lin, Shiyng-Yu
    Lin, Sanders H.
    Wu, Jenny L.
    Chang, Shy-Shin
    JMIR MEDICAL INFORMATICS, 2020, 8 (03)
  • [33] Predicting lung cancer survival based on clinical data using machine learning: A review
    Altuhaifa, Fatimah Abdulazim
    Win, Khin Than
    Su, Guoxin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [34] A Machine Learning Approach for High-Dimensional Time-to-Event Prediction With Application to Immunogenicity of Biotherapies in the ABIRISK Cohort
    Duhaze, Julianne
    Hassler, Signe
    Bachelet, Delphine
    Gleizes, Aude
    Hacein-Bey-Abina, Salima
    Allez, Matthieu
    Deisenhammer, Florian
    Fogdell-Hahn, Anna
    Mariette, Xavier
    Pallardy, Marc
    Broet, Philippe
    FRONTIERS IN IMMUNOLOGY, 2020, 11
  • [35] A multi-omics machine learning framework in predicting the survival of colorectal cancer patients
    Yang, Min
    Yang, Huandong
    Ji, Lei
    Hu, Xuan
    Tian, Geng
    Wang, Bing
    Yang, Jialiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [36] Predicting Survival of Tongue Cancer Patients by Machine Learning Models
    Vasilopoulos, Angelos
    Xi, Nan Miles
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2023, 3 (01): : 853 - 867
  • [37] Application of machine learning techniques for predicting survival in ovarian cancer
    Amir Sorayaie Azar
    Samin Babaei Rikan
    Amin Naemi
    Jamshid Bagherzadeh Mohasefi
    Habibollah Pirnejad
    Matin Bagherzadeh Mohasefi
    Uffe Kock Wiil
    BMC Medical Informatics and Decision Making, 22
  • [38] Survival machine learning model of T1 colorectal postoperative recurrence after endoscopic resection and surgical operation: a retrospective cohort study
    Li, Zhihong
    Aihemaiti, Yiliyaer
    Yang, Qianqian
    Ahemai, Yiliminuer
    Li, Zimei
    Du, Qianqian
    Wang, Yan
    Zhang, Hanxiang
    Cai, Yingbin
    BMC CANCER, 2025, 25 (01)
  • [39] Survival in familial colorectal cancer: a Danish cohort study
    Lautrup, Charlotte Kvist
    Mikkelsen, Ellen M.
    Lash, Timothy L.
    Katballe, Niels
    Sunde, Lone
    FAMILIAL CANCER, 2015, 14 (04) : 553 - 559
  • [40] Colorectal cancer survival rates in Makassar, Eastern Indonesia: A retrospective Cohort Study
    Labeda, Ibrahim
    Lusikooy, Ronald Erasio
    Mappincara
    Dani, Muhammad Iwan
    Sampetoding, Samuel
    Kusuma, Muhammad Ihwan
    Uwuratuw, Julianus Aboyaman
    Syarifuddin, Erwin
    Arsyad, Arham
    Faruk, Muhammad
    ANNALS OF MEDICINE AND SURGERY, 2022, 74