Multitask Learning With Recurrent Neural Networks for Acute Respiratory Distress Syndrome Prediction Using Only Electronic Health Record Data: Model Development and Validation Study

被引:8
|
作者
Lam, Carson [1 ]
Thapa, Rahul [1 ]
Maharjan, Jenish [1 ]
Rahmani, Keyvan [1 ]
Tso, Chak Foon [1 ]
Singh, Navan Preet [1 ]
Chetty, Satish Casie [1 ]
Mao, Qingqing [1 ]
机构
[1] Dascena Inc, 12333 Sowden Rd,Suite B, Houston, TX 77080 USA
关键词
deep learning; neural networks; ARDS; health care; multitask learning; clinical decision support; prediction model; COVID-19; electronic health record; risk outcome; respiratory distress; diagnostic criteria; recurrent neural network; EARLY IDENTIFICATION; VENTILATION; CRITERIA; SCORE; RISK;
D O I
10.2196/36202
中图分类号
R-058 [];
学科分类号
摘要
Background: Acute respiratory distress syndrome (ARDS) is a condition that is often considered to have broad and subjective diagnostic criteria and is associated with significant mortality and morbidity. Early and accurate prediction of ARDS and related conditions such as hypoxemia and sepsis could allow timely administration of therapies, leading to improved patient outcomes. Objective: The aim of this study is to perform an exploration of how multilabel classification in the clinical setting can take advantage of the underlying dependencies between ARDS and related conditions to improve early prediction of ARDS in patients. Methods: The electronic health record data set included 40,703 patient encounters from 7 hospitals from April 20, 2018, to March 17, 2021. A recurrent neural network (RNN) was trained using data from 5 hospitals, and external validation was conducted on data from 2 hospitals. In addition to ARDS, 12 target labels for related conditions such as sepsis, hypoxemia, and COVID-19 were used to train the model to classify a total of 13 outputs. As a comparator, XGBoost models were developed for each of the 13 target labels. Model performance was assessed using the area under the receiver operating characteristic curve. Heat maps to visualize attention scores were generated to provide interpretability to the neural networks. Finally, cluster analysis was performed to identify potential phenotypic subgroups of patients with ARDS. Results: The single RNN model trained to classify 13 outputs outperformed the individual XGBoost models for ARDS prediction, achieving an area under the receiver operating characteristic curve of 0.842 on the external test sets. Models trained on an increasing number of tasks resulted in improved performance. Earlier prediction of ARDS nearly doubled the rate of in-hospital survival. Cluster analysis revealed distinct ARDS subgroups, some of which had similar mortality rates but different clinical presentations. Conclusions: The RNN model presented in this paper can be used as an early warning system to stratify patients who are at risk of developing one of the multiple risk outcomes, hence providing practitioners with the means to take early action.
引用
收藏
页数:19
相关论文
共 37 条
  • [31] Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
    Grout, Randall W.
    Hui, Siu L.
    Imler, Timothy D.
    El-Azab, Sarah
    Baker, Jarod
    Sands, George H.
    Ateya, Mohammad
    Pike, Francis
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [32] Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
    Randall W. Grout
    Siu L. Hui
    Timothy D. Imler
    Sarah El-Azab
    Jarod Baker
    George H. Sands
    Mohammad Ateya
    Francis Pike
    BMC Medical Informatics and Decision Making, 21
  • [33] Development and Validation of Machine Learning Models for Prediction of 1-Year Mortality Utilizing Electronic Medical Record Data Available at the End of Hospitalization in Multicondition Patients: a Proof-of-Concept Study
    Sahni, Nishant
    Simon, Gyorgy
    Arora, Rashi
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2018, 33 (06) : 921 - 928
  • [34] Development and validation of a deep learning-based survival prediction model for pediatric glioma patients: A retrospective study using the SEER database and Chinese data
    Jiao, Yang
    Ye, Jianan
    Zhao, Wenjian
    Fan, Zhicheng
    Kou, Yunpeng
    Guo, Shaochun
    Chao, Min
    Fan, Chao
    Ji, Peigang
    Liu, Jinghui
    Zhai, Yulong
    Wang, Yuan
    Wang, Na
    Wang, Liang
    Computers in Biology and Medicine, 2024, 182
  • [35] Development and validation of a prediction model to estimate the risk of liver cirrhosis in primary care patients with abnormal liver blood test results: protocol for an electronic health record study in Clinical Practice Research Datalink
    Suvi Härmälä
    Alastair O’Brien
    Constantinos A. Parisinos
    Kenan Direk
    Laura Shallcross
    Andrew Hayward
    Diagnostic and Prognostic Research, 3 (1)
  • [36] Deep learning using computed tomography to identify high-risk patients for acute small bowel obstruction: development and validation of a prediction model : a retrospective cohort study
    Oh, Seungmin
    Ryu, Jongbin
    Shin, Ho-Jung
    Song, Jeong Ho
    Son, Sang-Yong
    Hur, Hoon
    Han, Sang-Uk
    INTERNATIONAL JOURNAL OF SURGERY, 2023, 109 (12) : 4091 - 4100
  • [37] Development and validation of a deep learning-based model to predict response and survival of T790M mutant non-small cell lung cancer patients in early clinical phase trials using electronic medical record and pharmacokinetic data
    Lou, Ning
    Gao, Ruyun
    Xu, Chi
    Qiao, Nan
    Jiang, Ji
    Wang, Lu
    Wang, Weicong
    Wang, Shanbo
    Shen, Wei
    Zheng, Xin
    Han, Xiaohong
    TRANSLATIONAL LUNG CANCER RESEARCH, 2024, 13 (04) : 706 - 720