Development, validation, and feature extraction of a deep learning model predicting in-hospital mortality using Japan's largest national ICU database: a validation framework for transparent clinical Artificial Intelligence (cAI) development

被引:3
作者
Ishii, Euma [1 ]
Nawa, Nobutoshi [2 ]
Hashimoto, Satoru [3 ]
Shigemitsu, Hidenobu [4 ]
Fujiwara, Takeo [1 ,5 ]
机构
[1] Tokyo Med & Dent Univ, Dept Global Hlth Promot, Tokyo, Japan
[2] Tokyo Med & Dent Univ, Dept Med Educ Res & Dev, Tokyo, Japan
[3] Kyoto Prefectural Univ Med, Dept Anesthesiol & Intens Care Med, Kyoto, Japan
[4] Tokyo Med & Dent Univ, Inst Global Affairs, Tokyo, Japan
[5] Tokyo Med & Dent Univ TMDU, Dept Global Hlth Promot, 1-5-45 Yushima, Bunkyo-ku, Tokyo 1138519, Japan
关键词
Clinical decision support; Machine learning; Artificial Intelligence; Mortality prediction; Ethical artificial intelligence; INTENSIVE-CARE UNITS; APACHE-II; SAPS-II; EXTERNAL VALIDATION; ACUTE PHYSIOLOGY; SCORE; READMISSION; SEVERITY; STAY; RISK;
D O I
10.1016/j.accpm.2022.101167
中图分类号
R614 [麻醉学];
学科分类号
100217 ;
摘要
Objective: While clinical Artificial Intelligence (cAI) mortality prediction models and relevant studies have increased, limitations including the lack of external validation studies and inadequate model calibration leading to decreased overall accuracy have been observed. To combat this, we developed and evaluated a novel deep neural network (DNN) and a validation framework to promote transparent cAI development. Methods: Data from Japan's largest ICU database was used to develop the DNN model, predicting in-hospital mortality including ICU and post-ICU mortality by days since ICU discharge. The most important variables to the model were extracted with SHapley Additive exPlanations (SHAP) to examine the DNN's efficacy as well as develop models that were also externally validated. Main results: The area under the receiver operating characteristic curve (AUC) for predicting ICU mortality was 0.94 [0.93-0.95], and 0.91 [0.90-0.92] for in-hospital mortality, ranging between 0.91-0.95 throughout one year since ICU discharge. An external validation using only the top 20 variables resulted with higher AUCs than traditional severity scores. Conclusions: Our DNN model consistently generated AUCs between 0.91-0.95 regardless of days since ICU discharge. The 20 most important variables to our DNN, also generated higher AUCs than traditional severity scores regardless of days since ICU discharge. To our knowledge, this is the first study that predicts ICU and in-hospital mortality using cAI by post-ICU discharge days up to over a year. This finding could contribute to increased transparency on cAI applications. (C) 2022 Societe francaise d'anesthesie et de reanimation (Sfar). Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:7
相关论文
共 49 条
[21]   Prediction of Clinical Deterioration in Hospitalized Adult Patients with Hematologic Malignancies Using a Neural Network Model [J].
Hu, Scott B. ;
Wong, Deborah J. L. ;
Correa, Aditi ;
Li, Ning ;
Deng, Jane C. .
PLOS ONE, 2016, 11 (08)
[22]   Early prediction of circulatory failure in the intensive care unit using machine learning [J].
Hyland, Stephanie L. ;
Faltys, Martin ;
Huser, Matthias ;
Lyu, Xinrui ;
Gumbsch, Thomas ;
Esteban, Cristobal ;
Bock, Christian ;
Horn, Max ;
Moor, Michael ;
Rieck, Bastian ;
Zimmermann, Marc ;
Bodenham, Dean ;
Borgwardt, Karsten ;
Ratsch, Gunnar ;
Merz, Tobias M. .
NATURE MEDICINE, 2020, 26 (03) :364-+
[23]   The Japanese Intensive care PAtient Database (JIPAD): A national intensive care unit registry in Japan [J].
Irie, Hiromasa ;
Okamoto, Hiroshi ;
Uchino, Shigehiko ;
Endo, Hideki ;
Uchida, Masatoshi ;
Kawasaki, Tatsuya ;
Kumasawa, Junji ;
Tagami, Takashi ;
Shigemitsu, Hidenobu ;
Hashiba, Eiji ;
Aoki, Yoshitaka ;
Kurosawa, Hiroshi ;
Hatakeyama, Junji ;
Ichihara, Nao ;
Hashimoto, Satoru ;
Nishimura, Masaji .
JOURNAL OF CRITICAL CARE, 2020, 55 :86-94
[24]  
Johnson AE., 2017, MACH LEARN HEALTHC C
[25]   With an eye to AI and autonomous diagnosis [J].
Keane, Pearse A. ;
Topol, Eric J. .
NPJ DIGITAL MEDICINE, 2018, 1
[26]   A Comparison of Intensive Care Unit Mortality Prediction Models through the Use of Data Mining Techniques [J].
Kim, Sujin ;
Kim, Woojae ;
Park, Rae Woong .
HEALTHCARE INFORMATICS RESEARCH, 2011, 17 (04) :232-243
[27]   APACHE - ACUTE PHYSIOLOGY AND CHRONIC HEALTH EVALUATION - A PHYSIOLOGICALLY BASED CLASSIFICATION-SYSTEM [J].
KNAUS, WA ;
ZIMMERMAN, JE ;
WAGNER, DP ;
DRAPER, EA ;
LAWRENCE, DE .
CRITICAL CARE MEDICINE, 1981, 9 (08) :591-597
[28]   Artificial Intelligence in Precision Cardiovascular Medicine [J].
Krittanawong, Chayakrit ;
Zhang, HongJu ;
Wang, Zhen ;
Aydar, Mehmet ;
Kitai, Takeshi .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2017, 69 (21) :2657-2664
[29]   Performance of a Deep-Learning Neural Network Model in Assessing Skeletal Maturity on Pediatric Hand Radiographs [J].
Larson, David B. ;
Chen, Matthew C. ;
Lungren, Matthew P. ;
Halabi, Safwan S. ;
Stence, Nicholas V. ;
Langlotz, Curtis P. .
RADIOLOGY, 2018, 287 (01) :313-322
[30]   SAPS 3 admission score: an external validation in a general intensive care population [J].
Ledoux, Didier ;
Canivet, Jean-Luc ;
Preiser, Jean-Charles ;
Lefrancq, Joealle ;
Damas, Pierre .
INTENSIVE CARE MEDICINE, 2008, 34 (10) :1873-1877