A survival prediction model via interpretable machine learning for patients with oropharyngeal cancer following radiotherapy

被引:9
作者
Pan, Xiaoying [1 ,2 ]
Feng, Tianhao [1 ,2 ]
Liu, Chen [1 ,2 ]
Savjani, Ricky R. [3 ]
Chin, Robert K. [3 ]
Qi, X. Sharon [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent Pr, Xian 710121, Peoples R China
[3] Univ Calif Los Angeles, Dept Radiat Oncol, Los Angeles, CA 90095 USA
基金
中国国家自然科学基金;
关键词
Oropharyngeal cancer; Machine learning; Survival analysis; Personalized treatment; Interpretability; RADIOMICS; SELECTION; IMAGES;
D O I
10.1007/s00432-023-04644-y
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
PurposeTo explore interpretable machine learning (ML) methods, with the hope of adding more prognosis value, for predicting survival for patients with Oropharyngeal-Cancer (OPC).MethodsA cohort of 427 OPC patients (Training 341, Test 86) from TCIA database was analyzed. Radiomic features of gross-tumor-volume (GTV) extracted from planning CT using Pyradiomics, and HPV p16 status, etc. patient characteristics were considered as potential predictors. A multi-level dimension reduction algorithm consisting of Least-Absolute-Selection-Operator (Lasso) and Sequential-Floating-Backward-Selection (SFBS) was proposed to effectively remove redundant/irrelevant features. The interpretable model was constructed by quantifying the contribution of each feature to the Extreme-Gradient-Boosting (XGBoost) decision by Shapley-Additive-exPlanations (SHAP) algorithm.ResultsThe Lasso-SFBS algorithm proposed in this study finally selected 14 features, and our prediction model achieved an area-under-ROC-curve (AUC) of 0.85 on the test dataset based on this feature set. The ranking of the contribution values calculated by SHAP shows that the top predictors that were most correlated with survival were ECOG performance status, wavelet-LLH_firstorder_Mean, chemotherapy, wavelet-LHL_glcm_InverseVariance, tumor size. Those patients who had chemotherapy, with positive HPV p16 status, and lower ECOG performance status, tended to have higher SHAP scores and longer survival; who had an older age at diagnosis, heavy drinking and smoking pack year history, tended to lower SHAP scores and shorter survival.ConclusionWe demonstrated predictive values of combined patient characteristics and imaging features for the overall survival of OPC patients. The multi-level dimension reduction algorithm can reliably identify the most plausible predictors that are mostly associated with overall survival. The interpretable patient-specific survival prediction model, capturing correlations of each predictor and clinical outcome, was developed to facilitate clinical decision-making for personalized treatment.
引用
收藏
页码:6813 / 6825
页数:13
相关论文
共 50 条
[31]   Prediction of the Fatigue Strength of Steel Based on Interpretable Machine Learning [J].
Liu, Chengcheng ;
Wang, Xuandong ;
Cai, Weidong ;
Yang, Jiahui ;
Su, Hang .
MATERIALS, 2023, 16 (23)
[32]   Interpretable Machine Learning Models for Prediction of UHPC Creep Behavior [J].
Zhu, Peng ;
Cao, Wenshuo ;
Zhang, Lianzhen ;
Zhou, Yongjun ;
Wu, Yuching ;
Ma, Zhongguo John .
BUILDINGS, 2024, 14 (07)
[33]   Breast cancer prediction based on gene expression data using interpretable machine learning techniques [J].
Kallah-Dagadu, Gabriel ;
Mohammed, Mohanad ;
Nasejje, Justine B. ;
Mchunu, Nobuhle Nokubonga ;
Twabi, Halima S. ;
Batidzirai, Jesca Mercy ;
Singini, Geoffrey Chiyuzga ;
Nevhungoni, Portia ;
Maposa, Innocent .
SCIENTIFIC REPORTS, 2025, 15 (01)
[34]   Prediction of lung cancer patient survival via supervised machine learning classification techniques [J].
Lynch, Chip M. ;
Abdollahi, Behnaz ;
Fuqua, Joshua D. ;
de Carlo, Alexandra R. ;
Bartholomai, James A. ;
Balgemann, Rayeanne N. ;
van Berkel, Victor H. ;
Frieboes, Hermann B. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 108 :1-8
[35]   Lung Cancer Survival Prediction via Machine Learning Regression, Classification, and Statistical Techniques [J].
Bartholomai, James A. ;
Frieboes, Hermann B. .
2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2018, :632-637
[36]   Developing an Interpretable Machine Learning Model for Early Prediction of Cardiovascular Involvement in Systemic Lupus Erythematosus [J].
Deng, Zixian ;
Liu, Huadong ;
Chen, Feng ;
Liu, Qiyun ;
Wang, Xiaoyu ;
Wang, Caiping ;
Lyu, Chuangye ;
Li, Jianghua ;
Li, Tangzhiming .
JOURNAL OF INFLAMMATION RESEARCH, 2025, 18 :8629-8641
[37]   Machine Learning Applied to survival prediction of elderly cancer patients: Systematic Review [J].
Lacerda, Tiago Beltrao ;
Medeiros, Alberto ;
Perez, Regis Batista ;
Cavalcanti Furtado, Ana Paula .
2020 15TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2020), 2020,
[38]   Interpretable machine learning models for crime prediction [J].
Zhang, Xu ;
Liu, Lin ;
Lan, Minxuan ;
Song, Guangwen ;
Xiao, Luzi ;
Chen, Jianguo .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2022, 94
[39]   Survival Prediction in Gallbladder Cancer Using CT Based Machine Learning [J].
Liu, Zefan ;
Zhu, Guannan ;
Jiang, Xian ;
Zhao, Yunuo ;
Zeng, Hao ;
Jing, Jing ;
Ma, Xuelei .
FRONTIERS IN ONCOLOGY, 2020, 10
[40]   Multimodal survival prediction in advanced pancreatic cancer using machine learning [J].
Keyl, J. ;
Kasper, S. ;
Wiesweg, M. ;
Goetze, J. ;
Schoenrock, M. ;
Sinn, M. ;
Berger, A. ;
Nasca, E. ;
Kostbade, K. ;
Schumacher, B. ;
Markus, P. ;
Albers, D. ;
Treckmann, J. ;
Schmid, K. W. ;
Schildhaus, H-U ;
Siveke, J. T. ;
Schuler, M. ;
Kleesiek, J. .
ESMO OPEN, 2022, 7 (05)