Early prediction of clinical deterioration using data-driven machine-learning modeling of electronic health records

被引:15
|
作者
Ruiz, Victor M. [1 ]
Goldsmith, Michael P. [2 ,3 ]
Shi, Lingyun [1 ]
Simpao, Allan F. [2 ,3 ]
Galvez, Jorge A. [2 ,3 ]
Naim, Maryam Y. [2 ,3 ]
Nadkarni, Vinay [2 ,3 ]
Gaynor, J. William [2 ,3 ]
Tsui, Fuchiang [1 ,2 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, Philadelphia, PA 19146 USA
[2] Childrens Hosp Philadelphia, Dept Anesthesiol & Crit Care Med, Philadelphia, PA 19146 USA
[3] Univ Penn, Pereleman Sch Med, Philadelphia, PA 19104 USA
来源
基金
美国安德鲁·梅隆基金会; 美国国家卫生研究院;
关键词
machine learning; electronic health records; univentricular heart; extracorporeal membrane oxygenation; cardiopulmonary resuscitation; intubation; intratracheal; CARDIAC-ARREST; CHILDREN; NAMES; SCORE;
D O I
10.1016/j.jtcvs.2021.10.060
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objectives: To develop and evaluate a high-dimensional, data-driven model to identify patients at high risk of clinical deterioration from routinely collected electronic health record (EHR) data. Materials and Methods: In this single-center, retrospective cohort study, 488 patients with single-ventricle and shunt-dependent congenital heart disease <6 months old were admitted to the cardiac intensive care unit before stage 2 palliation between 2014 and 2019. Using machine-learning techniques, we developed the Intensive care Warning Index (I-WIN), which systematically assessed 1028 regularly collected EHR variables (vital signs, medications, laboratory tests, and diagnoses) to identify patients in the cardiac intensive care unit at elevated risk of clinical deterioration. An ensemble of 5 extreme gradient boosting models was developed and validated on 203 cases (130 emergent endotracheal intubations, 34 cardiac arrests requiring cardiopulmonary resuscitation, 10 extracorporeal membrane oxygenation cannulations, and 29 cardiac arrests requiring cardiopulmonary resuscitation onto extracorporeal membrane oxygenation) and 378 control periods from 446 patients. Results: At 4 hours before deterioration, the model achieved an area under the receiver operating characteristic curve of 0.92 (95% confidence interval, 0.84-0.98), 0.881 sensitivity, 0.776 positive predictive value, 0.862 specificity, and 0.571 Brier skill score. Performance remained high at 8 hours before deterioration with 0.815 (0.688-0.921) area under the receiver operating characteristic curve. Conclusions: I-WIN accurately predicted deterioration events in critically-ill infants with high-risk congenital heart disease up to 8 hours before deterioration, potentially allowing clinicians to target interventions. We propose a paradigm shift from conventional expert consensus-based selection of risk factors to a data-driven, machine-learning methodology for risk prediction. With the increased availability of data capture in EHRs, I-WIN can be extended to broader applications in data-rich environments in critical care.
引用
收藏
页码:211 / +
页数:15
相关论文
共 50 条
  • [21] Early Prediction of Acute Kidney Injury in the Emergency Department With Machine-Learning Methods Applied to Electronic Health Record Data
    Martinez, Diego A.
    Levin, Scott R.
    Klein, Eili Y.
    Parikh, Chirag R.
    Menez, Steven
    Taylor, Richard A.
    Hinson, Jeremiah S.
    ANNALS OF EMERGENCY MEDICINE, 2020, 76 (04) : 501 - 514
  • [22] Data-Driven Machine-Learning Model in District Heating System for Heat Load Prediction: A Comparison Study
    Dalipi, Fisnik
    Yayilgan, Sule Yildirim
    Gebremedhin, Alemayehu
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2016, 2016
  • [23] Machine Learning for Prediction in Electronic Health Data
    Rose, Sherri
    JAMA NETWORK OPEN, 2018, 1 (04)
  • [24] Data-Driven Prediction of Janus/Core-Shell Morphology in Polymer Particles: A Machine-Learning Approach
    Esteki, Bahareh
    Masoomi, Mahmood
    Moosazadeh, Mohammad
    Yoo, ChangKyoo
    LANGMUIR, 2023, 39 (14) : 4943 - 4958
  • [25] A data-driven energy performance gap prediction model using machine learning
    Yilmaz, Derya
    Tanyer, Ali Murat
    Toker, Irem Dikmen
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2023, 181
  • [26] Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
    Gupta, Sunil
    Truyen Tran
    Luo, Wei
    Dinh Phung
    Kennedy, Richard Lee
    Broad, Adam
    Campbell, David
    Kipp, David
    Singh, Madhu
    Khasraw, Mustafa
    Matheson, Leigh
    Ashley, David M.
    Venkatesh, Svetha
    BMJ OPEN, 2014, 4 (03):
  • [27] A Framework for Modeling and Optimization of Data-Driven Energy Systems Using Machine Learning
    Danish M.S.S.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2434 - 2443
  • [28] Data-driven surrogate modeling of multiphase flows using machine learning techniques
    Ganti, Himakar
    Khare, Prashant
    COMPUTERS & FLUIDS, 2020, 211
  • [29] A paradigm for data-driven predictive modeling using field inversion and machine learning
    Parish, Eric J.
    Duraisamy, Karthik
    JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 305 : 758 - 774
  • [30] A data-driven approach using machine learning for early detection of the lean blowout
    Hasti, Veeraraghava Raju
    Navarkar, Abhishek
    Gore, Jay P.
    ENERGY AND AI, 2021, 5