Development, validation, and feature extraction of a deep learning model predicting in-hospital mortality using Japan's largest national ICU database: a validation framework for transparent clinical Artificial Intelligence (cAI) development

被引:3
作者
Ishii, Euma [1 ]
Nawa, Nobutoshi [2 ]
Hashimoto, Satoru [3 ]
Shigemitsu, Hidenobu [4 ]
Fujiwara, Takeo [1 ,5 ]
机构
[1] Tokyo Med & Dent Univ, Dept Global Hlth Promot, Tokyo, Japan
[2] Tokyo Med & Dent Univ, Dept Med Educ Res & Dev, Tokyo, Japan
[3] Kyoto Prefectural Univ Med, Dept Anesthesiol & Intens Care Med, Kyoto, Japan
[4] Tokyo Med & Dent Univ, Inst Global Affairs, Tokyo, Japan
[5] Tokyo Med & Dent Univ TMDU, Dept Global Hlth Promot, 1-5-45 Yushima, Bunkyo-ku, Tokyo 1138519, Japan
关键词
Clinical decision support; Machine learning; Artificial Intelligence; Mortality prediction; Ethical artificial intelligence; INTENSIVE-CARE UNITS; APACHE-II; SAPS-II; EXTERNAL VALIDATION; ACUTE PHYSIOLOGY; SCORE; READMISSION; SEVERITY; STAY; RISK;
D O I
10.1016/j.accpm.2022.101167
中图分类号
R614 [麻醉学];
学科分类号
100217 ;
摘要
Objective: While clinical Artificial Intelligence (cAI) mortality prediction models and relevant studies have increased, limitations including the lack of external validation studies and inadequate model calibration leading to decreased overall accuracy have been observed. To combat this, we developed and evaluated a novel deep neural network (DNN) and a validation framework to promote transparent cAI development. Methods: Data from Japan's largest ICU database was used to develop the DNN model, predicting in-hospital mortality including ICU and post-ICU mortality by days since ICU discharge. The most important variables to the model were extracted with SHapley Additive exPlanations (SHAP) to examine the DNN's efficacy as well as develop models that were also externally validated. Main results: The area under the receiver operating characteristic curve (AUC) for predicting ICU mortality was 0.94 [0.93-0.95], and 0.91 [0.90-0.92] for in-hospital mortality, ranging between 0.91-0.95 throughout one year since ICU discharge. An external validation using only the top 20 variables resulted with higher AUCs than traditional severity scores. Conclusions: Our DNN model consistently generated AUCs between 0.91-0.95 regardless of days since ICU discharge. The 20 most important variables to our DNN, also generated higher AUCs than traditional severity scores regardless of days since ICU discharge. To our knowledge, this is the first study that predicts ICU and in-hospital mortality using cAI by post-ICU discharge days up to over a year. This finding could contribute to increased transparency on cAI applications. (C) 2022 Societe francaise d'anesthesie et de reanimation (Sfar). Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:7
相关论文
共 49 条
[1]   SAPS II revisited [J].
Aegerter, P ;
Boumendil, A ;
Retbi, A ;
Minvielle, E ;
Dervaux, B ;
Guidet, B .
INTENSIVE CARE MEDICINE, 2005, 31 (03) :416-423
[2]   The characteristics of very short stay ICU admissions and implications for optimizing ICU resource utilization: the Saudi experience [J].
Arabi, Y ;
Venkatesh, S ;
Haddad, S ;
Al Malik, S ;
Al Shimemeri, A .
INTERNATIONAL JOURNAL FOR QUALITY IN HEALTH CARE, 2004, 16 (02) :149-155
[3]   Early hospital mortality prediction of intensive care unit patients using an ensemble learning approach [J].
Awad, Aya ;
Bader-El-Den, Mohamed ;
McNicholas, James ;
Briggs, Jim .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 108 :185-195
[4]  
Awad A, 2017, HEALTH SERV MANAG RE, V30, P105, DOI 10.1177/0951484817696212
[5]   External validation of the SAPS II, APACHE II and APACHE III prognostic models in South England: a multicentre study [J].
Beck, DH ;
Smith, GB ;
Pappachan, JV ;
Millar, B .
INTENSIVE CARE MEDICINE, 2003, 29 (02) :249-256
[6]   Validation of severity scoring systems SAPS II and APACHE II in a single-center population [J].
Capuzzo, M ;
Valpondi, V ;
Sgarbi, A ;
Bortolazzi, S ;
Pavoni, V ;
Gilli, G ;
Candini, G ;
Gritti, G ;
Alvisi, R .
INTENSIVE CARE MEDICINE, 2000, 26 (12) :1779-1785
[7]   Can we open the black box of AI? [J].
Castelvecchi D. .
Nature, 2016, 538 (7623) :20-23
[8]   Bridging the Health Data Divide [J].
Celi, Leo Anthony ;
Davidzon, Guido ;
Johnson, Alistair E. W. ;
Komorowski, Matthieu ;
Marshall, Dominic C. ;
Nair, Sunil S. ;
Phillips, Colin T. ;
Pollard, Tom J. ;
Raffa, Jesse D. ;
Salciccioli, Justin D. ;
Salgueiro, Francisco Muge ;
Stone, David J. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (12)
[9]   A comparison of Child-Pugh, APACHE II and APACHE III scoring systems in predicting hospital mortality of patients with liver cirrhosis [J].
Chatzicostas, C ;
Roussomoustakaki, M ;
Notas, G ;
Vlachonikolis, IG ;
Samonakis, D ;
Romanos, J ;
Vardas, E ;
Kouroumalis, EA .
BMC GASTROENTEROLOGY, 2003, 3 (1)
[10]  
Chen Yung-Che, 2007, Chang Gung Med J, V30, P142