Development of interpretable machine learning models to predict in-hospital prognosis of acute heart failure patients

被引:1
作者
Tanaka, Munekazu [1 ,2 ]
Kohjitani, Hirohiko [1 ,2 ]
Yamamoto, Erika [1 ]
Morimoto, Takeshi [3 ]
Kato, Takao [1 ]
Yaku, Hidenori [1 ]
Inuzuka, Yasutaka [4 ]
Tamaki, Yodo [5 ]
Ozasa, Neiko [1 ]
Seko, Yuta [1 ]
Shiba, Masayuki [1 ]
Yoshikawa, Yusuke [1 ]
Yamashita, Yugo [1 ]
Kitai, Takeshi [6 ]
Taniguchi, Ryoji [7 ]
Iguchi, Moritake [8 ]
Nagao, Kazuya [9 ]
Kawai, Takafumi [10 ]
Komasa, Akihiro [11 ]
Kawase, Yuichi [12 ]
Morinaga, Takashi [13 ]
Toyofuku, Mamoru [14 ]
Furukawa, Yutaka [15 ]
Ando, Kenji [13 ]
Kadota, Kazushige [12 ]
Sato, Yukihito [7 ]
Kuwahara, Koichiro [16 ]
Okuno, Yasushi [2 ]
Kimura, Takeshi [1 ,17 ]
Ono, Koh [1 ]
机构
[1] Kyoto Univ, Grad Sch Med, Dept Cardiovasc Med, 54 Shogoin Kawahara Cho,Sakyo Ku, Kyoto 6068507, Japan
[2] Kyoto Univ, Grad Sch Med, Dept Artificial Intelligence Healthcare & Med, Kyoto, Japan
[3] Hyogo Coll Med, Dept Clin Epidemiol, Nishinomiya, Japan
[4] Shiga Gen Hosp, Dept Cardiovasc Med, Moriyama, Japan
[5] Tenri Hosp, Div Cardiol, Tenri, Japan
[6] Natl Cerebral & Cardiovasc Ctr, Dept Cardiovasc Med, Suita, Japan
[7] Hyogo Prefectural Amagasaki Gen Med Ctr, Dept Cardiol, Amagasaki, Japan
[8] Natl Hosp Org Kyoto Med Ctr, Dept Cardiol, Kyoto, Japan
[9] Osaka Red Cross Hosp, Dept Cardiol, Osaka, Japan
[10] Kishiwada City Hosp, Dept Cardiol, Kishiwada, Japan
[11] Kansai Elect Power Hosp, Dept Cardiol, Osaka, Japan
[12] Kurashiki Cent Hosp, Dept Cardiol, Kurashiki, Japan
[13] Kokura Mem Hosp, Dept Cardiol, Kitakyushu, Japan
[14] Japanese Red Cross Wakayama Med Ctr, Dept Cardiol, Wakayama, Japan
[15] Kobe City Med Ctr Gen Hosp, Dept Cardiovasc Med, Kobe, Japan
[16] Shinshu Univ, Grad Sch Med, Dept Cardiovasc Med, Matsumoto, Japan
[17] Hirakata Kohsai Hosp, Dept Cardiol, Hirakata, Japan
关键词
Acute heart failure; Machine learning; Explainable model; SHAP; Decision tree model; RISK MODEL; CLASSIFICATION; MORTALITY; SCORE;
D O I
10.1002/ehf2.14834
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
AimsIn recent years, there has been remarkable development in machine learning (ML) models, showing a trend towards high prediction performance. ML models with high prediction performance often become structurally complex and are frequently perceived as black boxes, hindering intuitive interpretation of the prediction results. We aimed to develop ML models with high prediction performance, interpretability, and superior risk stratification to predict in-hospital mortality and worsening heart failure (WHF) in patients with acute heart failure (AHF).Methods and resultsBased on the Kyoto Congestive Heart Failure registry, which enrolled 4056 patients with AHF, we developed prediction models for in-hospital mortality and WHF using information obtained on the first day of admission (demographics, physical examination, blood test results, etc.). After excluding 16 patients who died on the first or second day of admission, the original dataset (n = 4040) was split 4:1 into training (n = 3232) and test datasets (n = 808). Based on the training dataset, we developed three types of prediction models: (i) the classification and regression trees (CART) model; (ii) the random forest (RF) model; and (iii) the extreme gradient boosting (XGBoost) model. The performance of each model was evaluated using the test dataset, based on metrics including sensitivity, specificity, area under the receiver operating characteristic curve (AUC), Brier score, and calibration slope. For the complex structure of the XGBoost model, we performed SHapley Additive exPlanations (SHAP) analysis, classifying patients into interpretable clusters. In the original dataset, the proportion of females was 44.8% (1809/4040), and the average age was 77.9 +/- 12.0. The in-hospital mortality rate was 6.3% (255/4040) and the WHF rate was 22.3% (900/4040) in the total study population. In the in-hospital mortality prediction, the AUC for the XGBoost model was 0.816 [95% confidence interval (CI): 0.815-0.818], surpassing the AUC values for the CART model (0.683, 95% CI: 0.680-0.685) and the RF model (0.755, 95% CI: 0.753-0.757). Similarly, in the WHF prediction, the AUC for the XGBoost model was 0.766 (95% CI: 0.765-0.768), outperforming the AUC values for the CART model (0.688, 95% CI: 0.686-0.689) and the RF model (0.713, 95% CI: 0.711-0.714). In the XGBoost model, interpretable clusters were formed, and the rates of in-hospital mortality and WHF were similar among each cluster in both the training and test datasets.ConclusionsThe XGBoost models with SHAP analysis provide high prediction performance, interpretability, and reproducible risk stratification for in-hospital mortality and WHF for patients with AHF.
引用
收藏
页码:2481 / +
页数:977
相关论文
共 36 条
[1]   Risk Prediction Models for Mortality in Ambulatory Patients With Heart Failure A Systematic Review [J].
Alba, Ana C. ;
Agoritsas, Thomas ;
Jankowski, Milosz ;
Courvoisier, Delphine ;
Walter, Stephen D. ;
Guyatt, Gordon H. ;
Ross, Heather J. .
CIRCULATION-HEART FAILURE, 2013, 6 (05) :881-889
[2]   Comparison of machine learning and the regression-based EHMRG model for predicting early mortality in acute heart failure [J].
Austin, David E. ;
Lee, Douglas S. ;
Wang, Chloe X. ;
Ma, Shihao ;
Wang, Xuesong ;
Porter, Joan ;
Wang, Bo .
INTERNATIONAL JOURNAL OF CARDIOLOGY, 2022, 365 :78-84
[3]   Unintended Consequences of Machine Learning in Medicine [J].
Cabitza, Federico ;
Rasoini, Raffaele ;
Gensini, Gian Franco .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (06) :517-518
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]   Using explainable machine learning to identify patients at risk of reattendance at discharge from emergency departments [J].
Chmiel, F. P. ;
Burns, D. K. ;
Azor, M. ;
Borca, F. ;
Boniface, M. J. ;
Zlatev, Z. D. ;
White, N. M. ;
Daniels, T. W., V ;
Kiuber, M. .
SCIENTIFIC REPORTS, 2021, 11 (01)
[6]   Development and validation of a risk model for in-hospital worsening heart failure from the Acute Decompensated Heart Failure National Registry (ADHERE) [J].
DeVore, Adam D. ;
Greiner, Melissa A. ;
Sharma, Puza P. ;
Qualls, Laura G. ;
Schulte, Phillip J. ;
Cooper, Lauren B. ;
Mentz, Robert J. ;
Pang, Peter S. ;
Fonarow, Gregg C. ;
Curtis, Lesley H. ;
Hernandez, Adrian F. .
AMERICAN HEART JOURNAL, 2016, 178 :198-205
[7]   Risk stratification for in-hospital mortality in acutely decompensated heart failure - Classification and regression tree analysis [J].
Fonarow, GC ;
Adams, KF ;
Abraham, WT ;
Yancy, CW ;
Boscardin, WJ .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2005, 293 (05) :572-580
[8]   Identifying and characterizing high-risk clusters in a heterogeneous ICU population with deep embedded clustering [J].
Forte, Jose Castela ;
Yeshmagambetova, Galiya ;
van der Grinten, Maureen L. ;
Hiemstra, Bart ;
Kaufmann, Thomas ;
Eck, Ruben J. ;
Keus, Frederik ;
Epema, Anne H. ;
Wiering, Marco A. ;
van der Horst, Iwan C. C. .
SCIENTIFIC REPORTS, 2021, 11 (01)
[9]  
Frazier P. I., 2018, TUTORIAL BAYESIAN OP
[10]   Decision tree -based diagnosis of coronary artery disease: CART model [J].
Ghiasi, Mohammad M. ;
Zendehboudi, Sohrab ;
Mohsenipour, Ali Asghar .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 192