Machine Learning-Enabled 30-Day Readmission Model for Stroke Patients

被引:22
作者
Darabi, Negar [1 ]
Hosseinichimeh, Niyousha [1 ]
Noto, Anthony [2 ]
Zand, Ramin [2 ]
Abedi, Vida [3 ,4 ]
机构
[1] Virginia Tech, Dept Ind & Syst Engn, Falls Church, VA 22043 USA
[2] Geisinger Hlth Syst, Geisinger Neurosci Inst, Danville, PA USA
[3] Geisinger Hlth Syst, Dept Mol & Funct Genom, Danville, PA 17822 USA
[4] Virginia Tech, Biocomplex Inst, Blacksburg, VA 24061 USA
来源
FRONTIERS IN NEUROLOGY | 2021年 / 12卷
关键词
ischemic stroke; 30-day readmissions; machine learning; statistical analysis; patient readmission;
D O I
10.3389/fneur.2021.638267
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background and Purpose: Hospital readmissions impose a substantial burden on the healthcare system. Reducing readmissions after stroke could lead to improved quality of care especially since stroke is associated with a high rate of readmission. The goal of this study is to enhance our understanding of the predictors of 30-day readmission after ischemic stroke and develop models to identify high-risk individuals for targeted interventions. Methods: We used patient-level data from electronic health records (EHR), five machine learning algorithms (random forest, gradient boosting machine, extreme gradient boosting-XGBoost, support vector machine, and logistic regression-LR), data-driven feature selection strategy, and adaptive sampling to develop 15 models of 30-day readmission after ischemic stroke. We further identified important clinical variables. Results: We included 3,184 patients with ischemic stroke (mean age: 71 +/- 13.90 years, men: 51.06%). Among the 61 clinical variables included in the model, the National Institutes of Health Stroke Scale score above 24, insert indwelling urinary catheter, hypercoagulable state, and percutaneous gastrostomy had the highest importance score. The Model's AUC (area under the curve) for predicting 30-day readmission was 0.74 (95%CI: 0.64-0.78) with PPV of 0.43 when the XGBoost algorithm was used with ROSE-sampling. The balance between specificity and sensitivity improved through the sampling strategy. The best sensitivity was achieved with LR when optimized with feature selection and ROSE-sampling (AUC: 0.64, sensitivity: 0.53, specificity: 0.69). Conclusions: Machine learning-based models can be designed to predict 30-day readmission after stroke using structured data from EHR. Among the algorithms analyzed, XGBoost with ROSE-sampling had the best performance in terms of AUC while LR with ROSE-sampling and feature selection had the best sensitivity. Clinical variables highly associated with 30-day readmission could be targeted for personalized interventions. Depending on healthcare systems' resources and criteria, models with optimized performance metrics can be implemented to improve outcomes.
引用
收藏
页数:10
相关论文
共 36 条
[1]   Increasing the Density of Laboratory Measures for Machine Learning Applications [J].
Abedi, Vida ;
Li, Jiang ;
Shivakumar, Manu K. ;
Avula, Venkatesh ;
Chaudhary, Durgesh P. ;
Shellenberger, Matthew J. ;
Khara, Harshit S. ;
Zhang, Yanfei ;
Lee, Ming Ta Michael ;
Wolk, Donna M. ;
Yeasin, Mohammed ;
Hontecillas, Raquel ;
Bassaganya-Riera, Josep ;
Zand, Ramin .
JOURNAL OF CLINICAL MEDICINE, 2021, 10 (01) :1-23
[2]   Sensitivity Analysis of an ENteric Immunity SImulator (ENISI)-Based Model of Immune Responses to Helicobacter pylori Infection [J].
Alam, Maksudul ;
Deng, Xinwei ;
Philipson, Casandra ;
Bassaganya-Riera, Josep ;
Bisset, Keith ;
Carbo, Adria ;
Eubank, Stephen ;
Hontecillas, Raquel ;
Hoops, Stefan ;
Mei, Yongguo ;
Abedi, Vida ;
Marathe, Madhav .
PLOS ONE, 2015, 10 (09)
[3]   Estimates and Temporal Trend for US Nationwide 30-Day Hospital Readmission Among Patients With Ischemic and Hemorrhagic Stroke [J].
Bambhroliya, Arvind B. ;
Donnelly, John P. ;
Thomas, Eric J. ;
Tyson, Jon E. ;
Miller, Charles C. ;
McCullough, Louise D. ;
Savitz, Sean, I ;
Vahidy, Farhaan S. .
JAMA NETWORK OPEN, 2018, 1 (04)
[4]   Influence of hospital-level practices on readmission after ischemic stroke [J].
Burke, James F. ;
Skolarus, Lesli E. ;
Adelman, Eric E. ;
Reeves, Mathew J. ;
Brown, Devin L. .
NEUROLOGY, 2014, 82 (24) :2196-2204
[5]   Trends in ischemic stroke outcomes in a rural population in the United States [J].
Chaudhary, Durgesh ;
Khan, Ayesha ;
Shahjouei, Shima ;
Gupta, Mudit ;
Lambert, Clare ;
Avula, Venkatesh ;
Schirmer, Clemens M. ;
Holland, Neil ;
Griessenauer, Christoph J. ;
Azarpazhooh, M. Reza ;
Li, Jiang ;
Abedi, Vida ;
Zand, Ramin .
JOURNAL OF THE NEUROLOGICAL SCIENCES, 2021, 422
[6]   Multi-Resolution Sensitivity Analysis of Model of Immune Response to Helicobacter pylori Infection via Spatio-Temporal Metamodeling [J].
Chen, Xi ;
Wang, Wenjing ;
Xie, Guangrui ;
Hontecillas, Raquel ;
Verma, Meghna ;
Leber, Andrew ;
Bassaganya-Riera, Josep ;
Abedi, Vida .
FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2019, 5
[7]  
Chuang Kun-Yang, 2005, J Nurs Res, V13, P117
[8]   Incorporating Artificial Intelligence Into Stroke Care and Research [J].
Ding, Lingling ;
Liu, Chelsea ;
Li, Zixiao ;
Wang, Yongjun .
STROKE, 2020, 51 (12) :E351-E354
[9]  
Francisco A, 2018, CIRCULATION, V138
[10]   Prediction of 30-Day All-Cause Readmissions in Patients Hospitalized for Heart Failure Comparison of Machine Learning and Other Statistical Approaches [J].
Frizzell, Jarrod D. ;
Liang, Li ;
Schulte, Phillip J. ;
Yancy, Clyde W. ;
Heidenreich, Paul A. ;
Hernandez, Adrian F. ;
Bhatt, Deepak L. ;
Fonarow, Gregg C. ;
Laskey, Warren K. .
JAMA CARDIOLOGY, 2017, 2 (02) :204-209