Development and Validation of an Interpretable Conformal Predictor to Predict Sepsis Mortality Risk: Retrospective Cohort Study

被引:1
作者
Yang, Meicheng [1 ]
Chen, Hui [2 ]
Hu, Wenhan [2 ]
Mischi, Massimo [3 ]
Shan, Caifeng [4 ,5 ]
Li, Jianqing [1 ,6 ]
Long, Xi [3 ]
Liu, Chengyu [1 ,7 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, State Key Lab Digital Med Engn, Nanjing, Peoples R China
[2] Southeast Univ, Zhongda Hosp, Dept Crit Care Med, Jiangsu Prov Key Lab Crit Care Med, Nanjing, Peoples R China
[3] Eindhoven Univ Technol, Dept Elect Engn, Eindhoven, Netherlands
[4] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao, Peoples R China
[5] Nanjing Univ, Sch Intelligence Sci & Technol, Nanjing, Peoples R China
[6] Nanjing Med Univ, Sch Biomed Engn & Informat, Nanjing, Peoples R China
[7] Southeast Univ, Sch Instrument Sci & Engn, State Key Lab Digital Med Engn, 35 Jinxianghe Rd, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
sepsis; critical care; clinical decision-making; mortality prediction; conformal prediction; HOSPITAL MORTALITY;
D O I
10.2196/50369
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Early and reliable identification of patients with sepsis who are at high risk of mortality is important to improve clinical outcomes. However, 3 major barriers to artificial intelligence (AI) models, including the lack of interpretability, the difficulty in generalizability, and the risk of automation bias, hinder the widespread adoption of AI models for use in clinical practice. Objective: This study aimed to develop and validate (internally and externally) a conformal predictor of sepsis mortality risk in patients who are critically ill, leveraging AI -assisted prediction modeling. The proposed approach enables explaining the model output and assessing its confidence level. Methods: We retrospectively extracted data on adult patients with sepsis from a database collected in a teaching hospital at Beth Israel Deaconess Medical Center for model training and internal validation. A large multicenter critical care database from the Philips eICU Research Institute was used for external validation. A total of 103 clinical features were extracted from the first day after admission. We developed an AI model using gradient -boosting machines to predict the mortality risk of sepsis and used Mondrian conformal prediction to estimate the prediction uncertainty. The Shapley additive explanation method was used to explain the model. Results: A total of 16,746 (80%) patients from Beth Israel Deaconess Medical Center were used to train the model. When tested on the internal validation population of 4187 (20%) patients, the model achieved an area under the receiver operating characteristic curve of 0.858 (95% CI 0.845-0.871), which was reduced to 0.800 (95% CI 0.789-0.811) when externally validated on 10,362 patients from the Philips eICU database. At a specified confidence level of 90% for the internal validation cohort the percentage of error predictions (n=438) out of all predictions (n=4187) was 10.5%, with 1229 (29.4%) predictions requiring clinician review. In contrast, the AI model without conformal prediction made 1449 (34.6%) errors. When externally validated, more predictions (n=4004, 38.6%) were flagged for clinician review due to interdatabase heterogeneity. Nevertheless, the model still produced significantly lower error rates compared to the point predictions by AI (n=1221, 11.8% vs n=4540, 43.8%). The most important predictors identified in this predictive model were Acute Physiology Score III, age, urine output, vasopressors, and pulmonary infection. Clinically relevant risk factors contributing to a single patient were also examined to show how the risk arose. Conclusions: By combining model explanation and conformal prediction, AI -based systems can be better translated into medical practice for clinical decision -making.
引用
收藏
页数:16
相关论文
共 42 条
  • [21] Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
    Luo, Wei
    Phung, Dinh
    Tran, Truyen
    Gupta, Sunil
    Rana, Santu
    Karmakar, Chandan
    Shilton, Alistair
    Yearwood, John
    Dimitrova, Nevenka
    Ho, Tu Bao
    Venkatesh, Svetha
    Berk, Michael
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (12)
  • [22] Joint Commission Warns of Alarm Fatigue Multitude of Alarms From Monitoring Devices Problematic
    Mitka, Mike
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2013, 309 (22): : 2315 - 2316
  • [23] Estimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction
    Olsson, Henrik
    Kartasalo, Kimmo
    Mulliqi, Nita
    Capuccini, Marco
    Ruusuvuori, Pekka
    Samaratunga, Hemamali
    Delahunt, Brett
    Lindskog, Cecilia
    Janssen, Emiel A. M.
    Blilie, Anders
    Egevad, Lars
    Spjuth, Ola
    Eklund, Martin
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [24] The "inconvenient truth" about AI in healthcare
    Panch, Trishan
    Mattie, Heather
    Celi, Leo Anthony
    [J]. NPJ DIGITAL MEDICINE, 2019, 2
  • [25] Predicting Sepsis Mortality in a Population-Based National Database: Machine Learning Approach
    Park, James Yeongjun
    Hsu, Tzu-Chun
    Hu, Jiun-Ruey
    Chen, Chun-Yuan
    Hsu, Wan-Ting
    Lee, Matthew
    Ho, Joshua
    Lee, Chien-Chang
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (04)
  • [26] physionet, MIMIC-IV data
  • [27] The eICU Collaborative Research Database, a freely available multi-center database for critical care research
    Pollard, Tom J.
    Johnson, Alistair E. W.
    Raffa, Jesse D.
    Celi, Leo A.
    Mark, Roger G.
    Badawi, Omar
    [J]. SCIENTIFIC DATA, 2018, 5
  • [28] AI in health and medicine
    Rajpurkar, Pranav
    Chen, Emma
    Banerjee, Oishi
    Topol, Eric J.
    [J]. NATURE MEDICINE, 2022, 28 (01) : 31 - 38
  • [29] Global, regional, and national sepsis incidence and mortality, 1990-2017: analysis for the Global Burden of Disease Study
    Rudd, Kristina E.
    Johnson, Sarah Charlotte
    Agesa, Kareha M.
    Shackelford, Katya Anne
    Tsoi, Derrick
    Kievlan, Daniel Rhodes
    Colombara, Danny V.
    Ikuta, Kevin S.
    Kissoon, Niranjan
    Finfer, Simon
    Fleischmann-Struzek, Carolin
    Machado, Flavia R.
    Reinhart, Konrad K.
    Rowan, Kathryn
    Seymour, Christopher W.
    Watson, R. Scott
    West, T. Eoin
    Marinho, Fatima
    Hay, Simon I.
    Lozano, Rafael
    Lopez, Alan D.
    Angus, Derek C.
    Murray, Christopher J. L.
    Naghavi, Mohsen
    [J]. LANCET, 2020, 395 (10219) : 200 - 211
  • [30] Taking the Human Out of the Loop: A Review of Bayesian Optimization
    Shahriari, Bobak
    Swersky, Kevin
    Wang, Ziyu
    Adams, Ryan P.
    de Freitas, Nando
    [J]. PROCEEDINGS OF THE IEEE, 2016, 104 (01) : 148 - 175