Early prediction of clinical deterioration using data-driven machine-learning modeling of electronic health records

被引:15
|
作者
Ruiz, Victor M. [1 ]
Goldsmith, Michael P. [2 ,3 ]
Shi, Lingyun [1 ]
Simpao, Allan F. [2 ,3 ]
Galvez, Jorge A. [2 ,3 ]
Naim, Maryam Y. [2 ,3 ]
Nadkarni, Vinay [2 ,3 ]
Gaynor, J. William [2 ,3 ]
Tsui, Fuchiang [1 ,2 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, Philadelphia, PA 19146 USA
[2] Childrens Hosp Philadelphia, Dept Anesthesiol & Crit Care Med, Philadelphia, PA 19146 USA
[3] Univ Penn, Pereleman Sch Med, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院; 美国安德鲁·梅隆基金会;
关键词
machine learning; electronic health records; univentricular heart; extracorporeal membrane oxygenation; cardiopulmonary resuscitation; intubation; intratracheal; CARDIAC-ARREST; CHILDREN; NAMES; SCORE;
D O I
10.1016/j.jtcvs.2021.10.060
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objectives: To develop and evaluate a high-dimensional, data-driven model to identify patients at high risk of clinical deterioration from routinely collected electronic health record (EHR) data. Materials and Methods: In this single-center, retrospective cohort study, 488 patients with single-ventricle and shunt-dependent congenital heart disease <6 months old were admitted to the cardiac intensive care unit before stage 2 palliation between 2014 and 2019. Using machine-learning techniques, we developed the Intensive care Warning Index (I-WIN), which systematically assessed 1028 regularly collected EHR variables (vital signs, medications, laboratory tests, and diagnoses) to identify patients in the cardiac intensive care unit at elevated risk of clinical deterioration. An ensemble of 5 extreme gradient boosting models was developed and validated on 203 cases (130 emergent endotracheal intubations, 34 cardiac arrests requiring cardiopulmonary resuscitation, 10 extracorporeal membrane oxygenation cannulations, and 29 cardiac arrests requiring cardiopulmonary resuscitation onto extracorporeal membrane oxygenation) and 378 control periods from 446 patients. Results: At 4 hours before deterioration, the model achieved an area under the receiver operating characteristic curve of 0.92 (95% confidence interval, 0.84-0.98), 0.881 sensitivity, 0.776 positive predictive value, 0.862 specificity, and 0.571 Brier skill score. Performance remained high at 8 hours before deterioration with 0.815 (0.688-0.921) area under the receiver operating characteristic curve. Conclusions: I-WIN accurately predicted deterioration events in critically-ill infants with high-risk congenital heart disease up to 8 hours before deterioration, potentially allowing clinicians to target interventions. We propose a paradigm shift from conventional expert consensus-based selection of risk factors to a data-driven, machine-learning methodology for risk prediction. With the increased availability of data capture in EHRs, I-WIN can be extended to broader applications in data-rich environments in critical care.
引用
收藏
页码:211 / +
页数:15
相关论文
共 50 条
  • [41] The Prediction of Flight Delay: Big Data-driven Machine Learning Approach
    Huo, Jiage
    Keung, K. L.
    Lee, C. K. M.
    Ng, Kam K. H.
    Li, K. C.
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 190 - 194
  • [42] Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type
    Qin, Yifan
    Wu, Jinlong
    Xiao, Wen
    Wang, Kun
    Huang, Anbing
    Liu, Bowen
    Yu, Jingxuan
    Li, Chuhao
    Yu, Fengyu
    Ren, Zhanbing
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (22)
  • [43] Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes
    Woldaregay, Ashenafi Zebene
    Arsand, Eirik
    Walderhaug, Stale
    Albers, David
    Mamykina, Lena
    Botsis, Taxiarchis
    Hartvigsen, Gunnar
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 98 (109-134) : 109 - 134
  • [44] Data Leakage in Health Outcomes Prediction With Machine Learning. Comment on "Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning"
    Chiavegatto Filho, Alexandre
    De Moraes Batista, Andre Filipe
    dos Santos, Hellen Geremias
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (02)
  • [45] Treatment effect prediction with adversarial deep learning using electronic health records
    Chu, Jiebin
    Dong, Wei
    Wang, Jinliang
    He, Kunlun
    Huang, Zhengxing
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 4)
  • [46] Readmission prediction using deep learning on electronic health records
    Ashfaq, Awais
    Sant'Anna, Anita
    Lingman, Markus
    Nowaczyk, Slawomir
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 97
  • [47] Data-Driven Load Forecasting Using Machine Learning and Meteorological Data
    Alrashidi A.
    Qamar A.M.
    Computer Systems Science and Engineering, 2023, 44 (03): : 1973 - 1988
  • [48] Data-driven approach to predict the fatigue properties of ferrous metal materials using the cGAN and machine-learning algorithms
    Li, Si-Geng
    Chen, Qiu-Ren
    Huang, Li
    Chen, Min
    Wei, Chen-Di
    Yue, Zhong-Jie
    Liu, Ru-Xue
    Tong, Chao
    Liu, Qing
    ADVANCES IN MANUFACTURING, 2024, 12 (03) : 447 - 464
  • [49] Data-driven strategy for contact angle prediction in underground hydrogen storage using machine learning
    Nassabeh, Mehdi
    You, Zhenjiang
    Keshavarz, Alireza
    Iglauer, Stefan
    JOURNAL OF ENERGY STORAGE, 2025, 114
  • [50] A data-driven approach for the prediction of coal seam gas content using machine learning techniques
    Akdas, Satuk Bugra
    Fisne, Abdullah
    APPLIED ENERGY, 2023, 347