Impact of Tumor Location on Predicting Early-Stage Breast Cancer Patient Survivability Using Explainable Machine Learning Models

被引:0
作者
Abdalnabi, Nader [1 ]
Adebiyi, Abdulmateen [2 ]
Alhonainy, Ahmad [2 ]
Naha, Kushal [3 ]
Papageorgiou, Christos [3 ,4 ]
Rao, Praveen [1 ,2 ]
机构
[1] MU Inst Data Sci & Informat, Columbia, MO 65211 USA
[2] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65201 USA
[3] Univ Missouri, Dept Med, Columbia, MO USA
[4] Ellis Fischel Canc Ctr, Columbia, MO USA
关键词
D O I
10.1200/CCI-24-00178
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
PURPOSEThis study aims to investigate the impact of tumor quadrant location on the 5-year early-stage breast cancer survivability prediction using explainable machine learning (ML) models. By integrating these predictive models with Shapley Additive Explanations (SHAP), feature importance, and coefficient effect size, we aim to provide insights into the significant factors influencing patient outcomes.METHODSData from 401 early-stage patients with breast cancer at the University of Missouri's Ellis Fischel Cancer Center were used, encompassing 20 variables related to demographics, tumor characteristics, and therapeutics. Six ML models, namely, Xtreme Gradient Boosting, Random Forest classifier, Logistic Regression, Decision Tree classifier (DT), Support Vector Machine classifier, and AdaBoost (ADB), were trained and evaluated using various performance metrics, including accuracy, sensitivity, specificity, F1-score, area under the receiver operating characteristic curve (AUC-ROC), and area under the precision-recall curve (AUC-PR). Feature importance, coefficient effect size, and SHAP values were used to interpret and visualize the importance of different features, particularly focusing on tumor quadrant variables.RESULTSThe extreme gradient boosting model outperformed other models, achieving an AUC-ROC score of 0.98 and an AUC-PR score of 0.97. The analysis revealed that tumor quadrant variables, especially the upper outer and miscellaneous or overlapping sites, were among the top predictive features for breast cancer survivability. SHAP analysis further highlighted the significance of these tumor locations in influencing survival outcomes.CONCLUSIONThis study demonstrates the efficacy of explainable ML models in predicting 5-year early-stage breast cancer survivability and identifies tumor quadrant location as an independent prognostic factor. The use of SHAP values provides a clear interpretation of the model's predictions, offering valuable insights for clinicians to refine treatment protocols and improve patient outcomes.
引用
收藏
页数:11
相关论文
共 22 条
[1]  
[Anonymous], DEEP LEARNING
[2]  
breastcancer, 2023, Breast Cancer Facts and Statistics
[3]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]   Integration of Machine Learning and Blockchain Technology in the Healthcare Field: A Literature Review and Implications for Cancer Care [J].
Cheng, Andy S. K. ;
Guan, Qiongyao ;
Su, Yan ;
Zhou, Ping ;
Zeng, Yingchun .
ASIA-PACIFIC JOURNAL OF ONCOLOGY NURSING, 2021, 8 (06) :720-724
[6]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[7]   Effect of Primary Breast Tumor Location on Axillary Nodal Positivity [J].
Desai, Amita A. ;
Hoskin, Tanya L. ;
Day, Courtney N. ;
Habermann, Elizabeth B. ;
Boughey, Judy C. .
ANNALS OF SURGICAL ONCOLOGY, 2018, 25 (10) :3011-3018
[8]   Applications of Blockchain Technology for Data-Sharing in Oncology: Results from a Systematic Literature Review [J].
Dubovitskaya, Alevtina ;
Novotny, Petr ;
Xu, Zhigang ;
Wang, Fusheng .
ONCOLOGY, 2020, 98 (06) :403-411
[9]   Triple-Negative Breast Cancer [J].
Foulkes, William D. ;
Smith, Ian E. ;
Reis-Filho, Jorge S. .
NEW ENGLAND JOURNAL OF MEDICINE, 2010, 363 (20) :1938-1948
[10]   Real-world outcomes for Chinese breast cancer patients with tumor location of central and nipple portion [J].
Fu, Wei-Da ;
Wang, Xiao-Hui ;
Lu, Kang-Kang ;
Lu, Yi-Qiao ;
Zhou, Jie-Yu ;
Huang, Qi-Di ;
Guo, Gui-Long .
FRONTIERS IN SURGERY, 2022, 9