Time-dependent interpretable survival prediction model for second primary NSCLC patients

被引:0
作者
Luo, Qiong [1 ]
Zhang, Qianyuan [2 ]
Liu, Haiyu [3 ]
Chen, Xiangqi [3 ]
Yang, Sheng [1 ]
Xu, Qian [1 ]
机构
[1] Fujian Med Univ Union Hosp, Dept Oncol Med, Fuzhou 350001, Peoples R China
[2] Fujian Med Univ Union Hosp, Dept Gen Med, Fuzhou 350001, Peoples R China
[3] Fujian Med Univ Union Hosp, Dept Pulm & Crit Care Med, Fuzhou 350001, Peoples R China
关键词
Second primary non-small cell lung cancer; Machine learning; Overall survival prediction; Time-dependent interpretability; Surgery; PRIMARY LUNG-CANCER; DIAGNOSIS; PROGNOSIS;
D O I
10.1016/j.ijmedinf.2024.105771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Accurate predictive models for second primary non-small cell lung cancer (SP-NSCLC) are limited. This study aimed to develop and validate overall survival (OS) prediction models for SP-NSCLC patients using timedependent interpretable survival machine learning algorithms. Methods: This study utilized data from the Surveillance, Epidemiology, and End Results (SEER) database, encompassing 8 and 12 registries, to extract data on patients aged 20-89 diagnosed with SP-NSCLC between 1988 and 2020. The dataset was divided into development, external temporal and spatial validation cohorts. Predictors included demographic, clinical, pathological and initial primary cancer-related features. Multiple survival machine learning algorithms were developed and validated, assessing model performance using C-index, time-dependent area under the receiver operating characteristic curve (time-AUC), and time-dependent Brier Score. The time-dependent interpretability analysis was employed to explore the time-dependent feature importance of key predictors. Results: The Blackboost model demonstrated excellent performance (C-index: 0.7517, and time-AUC: 0.8438), and good calibration (time-Brier Score of 0.0754). External validations and subgroup analyses demonstrated the robustness, generalizability, and fairness. Utilizing the optimal cutoff threshold, high-risk groups could be effectively identified. Surgery was the most critical predictor across the entire survival period. Combined stage (distant) and chemotherapy were the second most important predictors within 0 to 5 years, while age replaced from 5 to 20 years. Additionally, we developed an online visualization tool. Conclusions: The Blackboost survival model achieved accurate, fair, and robust survival prediction for SP-NSCLC patients. Surgery, combined stage (distant), chemotherapy, and age contributed differently across various survival periods. The online visualization tool facilitated personalized survival predictions.
引用
收藏
页数:9
相关论文
共 51 条
  • [1] Explainable deep learning-based survival prediction for non-small cell lung cancer patients undergoing radical radiotherapy
    Astley, Joshua R.
    Reilly, James M.
    Robinson, Stephen
    Wild, Jim M.
    Hatton, Matthew Q.
    Tahir, Bilal A.
    [J]. RADIOTHERAPY AND ONCOLOGY, 2024, 193
  • [2] Anatomic resection has superior long-term survival compared with wedge resection for second primary lung cancer after prior lobectomy
    Baig, Mirza Zain
    Razi, Syed S.
    Stroever, Stephanie
    Weber, Joanna F.
    Connery, Cliff P.
    Bhora, Faiz Y.
    [J]. EUROPEAN JOURNAL OF CARDIO-THORACIC SURGERY, 2021, 59 (05) : 1014 - 1020
  • [3] Boosting for high-dimensional time-to-event data with competing risks
    Binder, Harald
    Allignol, Arthur
    Schumacher, Martin
    Beyersmann, Jan
    [J]. BIOINFORMATICS, 2009, 25 (07) : 890 - 896
  • [4] Boosting with the L2 loss:: Regression and classification
    Bühlmann, P
    Yu, B
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) : 324 - 339
  • [5] The need to separate the wheat from the chaff in medical informatics Introducing a comprehensive checklist for the (self)-assessment of medical AI studies
    Cabitza, Federico
    Campagner, Andrea
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 153
  • [6] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [7] The Survival Impact of Second Primary Lung Cancer in Patients With Lung Cancer
    Choi, Eunji
    Luo, Sophia J.
    Aredo, Jacqueline, V
    Backhus, Leah M.
    Wilkens, Lynne R.
    Su, Chloe C.
    Neal, Joel W.
    Le Marchand, Loic
    Cheng, Iona
    Wakelee, Heather A.
    Han, Summer S.
    [J]. JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2022, 114 (04): : 618 - 625
  • [8] Collins GS, 2015, J CLIN EPIDEMIOL, V68, P112, DOI [10.1016/j.jclinepi.2014.11.010, 10.1186/s12916-014-0241-z, 10.1002/bjs.9736, 10.1038/bjc.2014.639, 10.1016/j.eururo.2014.11.025, 10.7326/M14-0697, 10.7326/M14-0698, 10.1136/bmj.g7594]
  • [9] Lung cancer survival period prediction and understanding: Deep learning approaches
    Doppalapudi, Shreyesh
    Qiu, Robin G.
    Badr, Youakim
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 148
  • [10] Survival in Patients with Metachronous Second Primary Lung Cancer
    Duc Ha
    Choi, Humberto
    Chevalier, Cory
    Zell, Katrina
    Wang, Xiao-Feng
    Mazzone, Peter J.
    [J]. ANNALS OF THE AMERICAN THORACIC SOCIETY, 2015, 12 (01) : 79 - 84