Predicting 72-hour and 9-day return to the emergency department using machine learning

被引:27
作者
Hong, Woo Suk [1 ]
Haimovich, Adrian Daniel [2 ]
Taylor, Richard Andrew [2 ]
机构
[1] Yale Sch Med, New Haven, CT 06519 USA
[2] Yale Sch Med, Dept Emergency Med, 464 Congress Ave,Ste 260, New Haven, CT 06519 USA
关键词
decision support techniques; emergency medicine; machine learning; SOCIAL-WORK; VISITS; RISK; ADMISSION; OUTCOMES; DIAGNOSIS; ISSUES; TOOL;
D O I
10.1093/jamiaopen/ooz019
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: To predict 72-h and 9-day emergency department (ED) return by using gradient boosting on an expansive set of clinical variables from the electronic health record. Methods: This retrospective study included all adult discharges from a level 1 trauma center ED and a community hospital ED covering the period of March 2013 to July 2017. A total of 1500 variables were extracted for each visit, and samples split randomly into training, validation, and test sets (80%, 10%, and 10%). Gradient boosting models were fit on 3 selections of the data: administrative data (demographics, prior hospital usage, and comorbidity categories), data available at triage, and the full set of data available at discharge. A logistic regression (LR) model built on administrative data was used for baseline comparison. Finally, the top 20 most informative variables identified from the full gradient boosting models were used to build a reduced model for each outcome. Results: A total of 330 631 discharges were available for analysis, with 29 058 discharges (8.8%) resulting in 72 h return and 52 748 discharges (16.0%) resulting in 9-day return to either ED. LR models using administrative data yielded test AUCs of 0.69 (95% confidence interval [CI] 0.68-0.70) and 0.71(95% CI 0.70-0.72), while gradient boosting models using administrative data yielded test AUCs of 0.73 (95% CI 0.72-0.74) and 0.74 (95% CI 0.73-0.74) for 72-h and 9-day return, respectively. Gradient boosting models using variables available at triage yielded test AUCs of 0.75 (95% CI 0.74-0.76) and 0.75 (95% CI 0.74-0.75), while those using the full set of variables yielded test AUCs of 0.76 (95% CI 0.75-0.77) and 0.75 (95% CI 0.75-0.76). Reduced models using the top 20 variables yielded test AUCs of 0.73 (95% CI 0.71-0.74) and 0.73 (95% CI 0.72-0.74). Discussion and Conclusion: Gradient boosting models leveraging clinical data are superior to LR models built on administrative data at predicting 72-h and 9-day returns.
引用
收藏
页码:346 / 352
页数:7
相关论文
共 53 条
[1]   THE PREVALENCE OF QUALITY ISSUES AND ADVERSE OUTCOMES AMONG 72-HOUR RETURN ADMISSIONS IN THE EMERGENCY DEPARTMENT [J].
Abualenain, Jameel ;
Frohna, William J. ;
Smith, Mark ;
Pipkin, Michael ;
Webb, Cynthia ;
Milzman, David ;
Pines, Jesse M. .
JOURNAL OF EMERGENCY MEDICINE, 2013, 45 (02) :281-287
[2]  
[Anonymous], 2017, 2017 SYST INF ENG
[3]  
Centers for Medicare and Medicaid Services, 2018, HOSP READM RED PROGR
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]  
Collins GS, 2015, ANN INTERN MED, V162, P55, DOI [10.1016/j.jclinepi.2014.11.010, 10.1038/bjc.2014.639, 10.1136/bmj.g7594, 10.1016/j.eururo.2014.11.025, 10.7326/M14-0697, 10.1186/s12916-014-0241-z, 10.1002/bjs.9736, 10.7326/M14-0698]
[6]   THE RISK OF DETERMINING RISK WITH MULTIVARIABLE MODELS [J].
CONCATO, J ;
FEINSTEIN, AR ;
HOLFORD, TR .
ANNALS OF INTERNAL MEDICINE, 1993, 118 (03) :201-210
[7]  
Corbett Helen M, 2005, Aust Health Rev, V29, P43
[8]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[9]   Development and Application of a Machine Learning Approach to Assess Short-term Mortality Risk Among Patients With Cancer Starting Chemotherapy [J].
Elfiky, Aymen A. ;
Pany, Maximilian J. ;
Parikh, Ravi B. ;
Obermeyer, Ziad .
JAMA NETWORK OPEN, 2018, 1 (03) :e180926
[10]   Using the Electronic Medical Record to Identify Patients at High Risk for Frequent Emergency Department Visits and High System Costs [J].
Frost, David W. ;
Vembu, Shankar ;
Wang, Jiayi ;
Tu, Karen ;
Morris, Quaid ;
Abrams, Howard B. .
AMERICAN JOURNAL OF MEDICINE, 2017, 130 (05) :601.e17-601.e22