A deep attention model to forecast the Length Of Stay and the in-hospital mortality right on admission from ICD codes and demographic data

被引：26

作者：

Harerimana, Gaspard ^{[1
]}

Kim, Jong Wook ^{[1
]}

Jang, Beakcheol ^{[2
]}

机构：

[1] Sangmyung Univ, Dept Comp Sci, Seoul, South Korea

[2] Yonsei Univ, Grad Sch Informat, Seoul, South Korea

来源：

JOURNAL OF BIOMEDICAL INFORMATICS | 2021年 / 118卷

基金：

新加坡国家研究基金会;

关键词：

Boosting; Class imbalance; Length of stay; Electronic health record; Hierarchical attention network;

D O I：

10.1016/j.jbi.2021.103778

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Leveraging the Electronic Health Records (EHR) longitudinal data to produce actionable clinical insights has always been a critical issue for recent studies. Non-forecasted extended hospitalizations account for a disproportionate amount of resource use, the mediocre quality of inpatient care, and avoidable fatalities. The capability to predict the Length of Stay (LoS) and mortality in the early stages of the admission provides opportunities to improve care and prevent many preventable losses. Forecasting the in-hospital mortality is important in providing clinicians with enough insights to make decisions and hospitals to allocate resources, hence predicting the LoS and mortality within the first day of admission is a difficult but a paramount endeavor. The biggest challenge is that few data are available by this time, thus the prediction has to bring in the previous admissions history and free text diagnosis that are recorded immediately on admission. We propose a model that uses the multi-modal EHR structured medical codes and key demographic information to classify the LoS in 3 classes; Short Los (LoS <= 10 days), Medium LoS (10 LoS <= 30 days) and Long LoS (LoS 30 days) as well as mortality as a binary classification of a patient's death during current admission. The prediction has to use data available only within 24 h of admission. The key predictors include previous ICD9 diagnosis codes, ICD9 procedures, key demographic data, and free text diagnosis of the current admission recorded right on admission. We propose a Hierarchical Attention Network (HAN-LoS and HAN-Mor) model and train it to a dataset of over 45321 admissions recorded in the de-identified MIMIC-III dataset. For improved prediction, our attention mechanisms can focus on the most influential past admissions and most influential codes in these admissions. For fair performance evaluation, we implemented and compared the HAN model with previous approaches. With dataset balancing techniques HAN-LoS achieved an AUROC of over 0.82 and a Micro-F1 score of 0.24 and HAN-Mor achieved AUCROC of 0.87 hence outperforming the existing baselines that use structured medical codes as well as clinical time series for LoS and Mortality forecasting. By predicting mortality and LoS using the same model, we show that with little tuning the proposed model can be used for other clinical predictive tasks like phenotyping, decompensation,re-admission prediction, and survival analysis.

引用

页数：11

共 59 条

[51]

Trask A., 2015, ARXIV PREPRINT ARXIV

[52] Length of Hospital Stay Prediction at the Admission Stage for Cardiology Patients Using Artificial Neural Network [J].

Tsai, Pei-Fang ;

Chen, Po-Chia ;

Chen, Yen-You ;

Song, Hao-Yuan ;

Lin, Hsiu-Mei ;

Lin, Fu-Man ;

Huang, Qiou-Pieng .

JOURNAL OF HEALTHCARE ENGINEERING, 2016, 2016

[53]

Vaswani A, 2017, ADV NEUR IN, V30

[54]

Wang B.X., 2004, P IRIS MACH LEARN WO, V19

[55] Multiclass Imbalance Problems: Analysis and Potential Solutions [J].

Wang, Shuo ;

Yao, Xin .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (04) :1119-1130

[56]

Yang Zichao, 2016, NAACL 2016, P1480, DOI DOI 10.18653/V1/N16-1174

[57]

Zebin T., 2019, 2019 IEEE C COMP INT, P1

[58]

Zhang Y., 2019, P 28 INT JOINT C ART, P10

[59] Synthetic minority oversampling technique for multiclass imbalance problems [J].

Zhu, Tuanfei ;

Lin, Yaping ;

Liu, Yonghe .

PATTERN RECOGNITION, 2017, 72 :327-340

← 1 2 3 4 5 6 →