Machine Learning-Based Hospital Discharge Prediction for Patients With Cardiovascular Diseases: Development and Usability Study

被引:10
作者
Ahn, Imjin [1 ]
Gwon, Hansle [1 ]
Kang, Heejun [2 ]
Kim, Yunha [1 ]
Seo, Hyeram [1 ]
Choi, Heejung [1 ]
Cho, Ha Na [2 ]
Kim, Minkyoung [2 ]
Jun, Tae Joon [3 ]
Kim, Young-Hak [2 ]
机构
[1] Univ Ulsan, Asan Med Ctr, Asan Med Inst Convergence Sci & Technol, Coll Med,Dept Med Sci, Seoul, South Korea
[2] Univ Ulsan, Asan Med Ctr, Dept Internal Med, Div Cardiol,Coll Med, 88,Olymp Ro 43 Gil, Seoul 05505, South Korea
[3] Asan Med Ctr, Big Data Res Ctr, Asan Inst Life Sci, Seoul, South Korea
关键词
electronic health records; cardiovascular diseases; discharge prediction; bed management; explainable artificial intelligence; LENGTH-OF-STAY; MODEL; MANAGEMENT; TIME;
D O I
10.2196/32662
中图分类号
R-058 [];
学科分类号
摘要
Background: Effective resource management in hospitals can improve the quality of medical services by reducing labor-intensive burdens on staff, decreasing inpatient waiting time, and securing the optimal treatment time. The use of hospital processes requires effective bed management; a stay in the hospital that is longer than the optimal treatment time hinders bed management. Therefore, predicting a patient's hospitalization period may support the making of judicious decisions regarding bed management. Objective: First, this study aims to develop a machine learning (ML)-based predictive model for predicting the discharge probability of inpatients with cardiovascular diseases (CVDs). Second, we aim to assess the outcome of the predictive model and explain the primary risk factors of inpatients for patient-specific care. Finally, we aim to evaluate whether our ML-based predictive model helps manage bed scheduling efficiently and detects long-term inpatients in advance to improve the use of hospital processes and enhance the quality of medical services. Methods: We set up the cohort criteria and extracted the data from CardioNet, a manually curated database that specializes in CVDs. We processed the data to create a suitable data set by reindexing the date-index, integrating the present features with past features from the previous 3 years, and imputing missing values. Subsequently, we trained the ML-based predictive models and evaluated them to find an elaborate model. Finally, we predicted the discharge probability within 3 days and explained the outcomes of the model by identifying, quantifying, and visualizing its features. Results: We experimented with 5 ML-based models using 5 cross-validations. Extreme gradient boosting, which was selected as the final model, accomplished an average area under the receiver operating characteristic curve score that was 0.865 higher than that of the other models (ie, logistic regression, random forest, support vector machine, and multilayer perceptron). Furthermore, we performed feature reduction, represented the feature importance, and assessed prediction outcomes. One of the outcomes, the individual explainer, provides a discharge score during hospitalization and a daily feature influence score to the medical team and patients. Finally, we visualized simulated bed management to use the outcomes. Conclusions: In this study, we propose an individual explainer based on an ML-based predictive model, which provides the discharge probability and relative contributions of individual features. Our model can assist medical teams and patients in identifying individual and common risk factors in CVDs and can support hospital administrators in improving the management of hospital beds and other resources.
引用
收藏
页数:17
相关论文
共 25 条
[1]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[2]   CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases [J].
Ahn, Imjin ;
Na, Wonjun ;
Kwon, Osung ;
Yang, Dong Hyun ;
Park, Gyung-Min ;
Gwon, Hansle ;
Kang, Hee Jun ;
Jeong, Yeon Uk ;
Yoo, Jungsun ;
Kim, Yunha ;
Jun, Tae Joon ;
Kim, Young-Hak .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
[3]  
[Anonymous], 2021, CARDIOVASCULAR DIS C
[4]   Real-time prediction of inpatient length of stay for discharge prioritization [J].
Barnes, Sean ;
Hamrock, Eric ;
Toerper, Matthew ;
Siddiqui, Sauleh ;
Levin, Scott .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (E1) :E2-E10
[5]   An integer linear model for hospital bed planning [J].
Ben Bachouch, Rym ;
Guinet, Alain ;
Hajri-Gabouj, Sonia .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2012, 140 (02) :833-843
[6]  
Breiman L., 2001, Mach. Learn., V45, P5
[7]  
Chen T, 2016, KDD16 P 22 ACM, DOI DOI 10.1145/2939672.2939785
[8]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[9]   Logistic regression and artificial neural network classification models: a methodology review [J].
Dreiseitl, S ;
Ohno-Machado, L .
JOURNAL OF BIOMEDICAL INFORMATICS, 2002, 35 (5-6) :352-359
[10]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422