Explainable SHAP-XGBoost models for in-hospital mortality after myocardial infarction

被引:5
作者
Tarabanis, Constantine [1 ,4 ]
Kalampokis, Evangelos [2 ]
Khalil, Mahmoud [3 ]
Alviar, Carlos L. [1 ]
Chinitz, Larry A. [1 ]
Jankelson, Lior [1 ]
机构
[1] NYU, Leon H Charney Div Cardiol, NYU Langone Hlth, Sch Med, New York, NY USA
[2] Univ Macedonia, Informat Syst Lab, Thessaloniki, Greece
[3] Lincoln Med Ctr, Dept Internal Med, Bronx, NY USA
[4] NYU Langone Hlth, 564 1st Ave,Apt 18N, New York, NY 10016 USA
来源
CARDIOVASCULAR DIGITAL HEALTH JOURNAL | 2023年 / 4卷 / 04期
基金
美国医疗保健研究与质量局;
关键词
Acute coronary syndrome; Explainable machine learning; Myocardial infarction; In-hospital mortality; SHAP; PERCUTANEOUS CORONARY INTERVENTION; MACHINE LEARNING-MODELS; TIMI RISK SCORE; SEX; REGISTRY; IMPACT;
D O I
10.1016/j.cvdhj.2023.06.001
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BACKGROUND A lack of explainability in published machine learning (ML) models limits clinicians' understanding of how predictions are made, in turn undermining uptake of the models into clinical practice.OBJECTIVE The purpose of this study was to develop explainable ML models to predict in-hospital mortality in patients hospitalized for myocardial infarction (MI).METHODS Adult patients hospitalized for an MI were identified in the National Inpatient Sample between January 1, 2012, and September 30, 2015. The resulting cohort comprised 457,096 patients described by 64 predictor variables relating to demographic/comorbidity characteristics and in-hospital complications. The gradient boosting algorithm eXtreme Gradient Boosting (XGBoost) was used to develop explainable models for in-hospital mortality prediction in the overall cohort and patient subgroups based on MI type and/or sex.RESULTS The resulting models exhibited an area under the receiver operating characteristic curve (AUC) ranging from 0.876 to 0.942, specificity 82% to 87%, and sensitivity 75% to 87%. All models exhibited high negative predictive value >0.974. The SHapley Additive exPlanation (SHAP) framework was applied to explain the models. The top predictor variables of increasing and decreasing mortality were age and undergoing percutaneous coronary intervention, respectively. Other notable findings included a decreased mortality risk associated with certain patient subpopulations with hyperlipidemia and a comparatively greater risk of death among women below age 55 years.CONCLUSION The literature lacks explainable ML models predicting in-hospital mortality after an MI. In a national registry, explainable ML models performed best in ruling out in-hospital death post MI, and their explanation illustrated their potential for guiding hypothesis generation and future study design.
引用
收藏
页码:126 / 132
页数:7
相关论文
共 38 条
[1]  
Agency for Healthcare Research and Quality, 2006, Using the HCUP Nationwide Inpatient Sample to Estimate Trends
[2]  
Agency for Healthcare Research and Quality. Healthcare Cost and Utilization Project (HCUP) National Inpatient Sample (NIS), 2012, Healthcare Cost and Utilization Project (HCUP) US National Inpatient Sample Overview
[3]   In-hospital mortality of habitual cigarette smokers after acute myocardial infarction - The 'smoker's paradox' in a countrywide study [J].
Andrikopoulos, GK ;
Richter, DJ ;
Dilaveris, PE ;
Pipilis, A ;
Zaharoulis, A ;
Gialafos, JE ;
Toutouzas, PK ;
Chimonas, ET .
EUROPEAN HEART JOURNAL, 2001, 22 (09) :776-784
[4]   The TIMI risk score for unstable angina/non-ST elevation MI - A method for prognostication and therapeutic decision making [J].
Antman, EM ;
Cohen, M ;
Bernink, PJLM ;
McCabe, CH ;
Horacek, T ;
Papuchis, G ;
Mautner, B ;
Corbalan, R ;
Radley, D ;
Braunwald, E .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2000, 284 (07) :835-842
[5]   Current Challenges and Future Opportunities for XAI in Machine Learning-Based Clinical Decision Support Systems: A Systematic Review [J].
Antoniadi, Anna Markella ;
Du, Yuhan ;
Guendouz, Yasmine ;
Wei, Lan ;
Mazo, Claudia ;
Becker, Brett A. ;
Mooney, Catherine .
APPLIED SCIENCES-BASEL, 2021, 11 (11)
[6]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[7]   The Role of Explanations on Trust and Reliance in Clinical Decision Support Systems [J].
Bussone, Adrian ;
Stumpf, Simone ;
O'Sullivan, Dympna .
2015 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2015), 2015, :160-169
[8]   Association of Age and Sex With Myocardial Infarction Symptom Presentation and In-Hospital Mortality [J].
Canto, John G. ;
Rogers, William J. ;
Goldberg, Robert J. ;
Peterson, Eric D. ;
Wenger, Nanette K. ;
Vaccarino, Viola ;
Kiefe, Catarina I. ;
Frederick, Paul D. ;
Sopko, George ;
Zheng, Zhi-Jie .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2012, 307 (08) :813-822
[9]   Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission [J].
Caruana, Rich ;
Lou, Yin ;
Gehrke, Johannes ;
Koch, Paul ;
Sturm, Marc ;
Elhadad, Noemie .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1721-1730
[10]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794