Using random forests to model 90-day hometime in people with stroke

被引:15
作者
Holodinsky, Jessalyn K. [1 ]
Yu, Amy Y. X. [2 ,3 ]
Kapral, Moira K. [2 ,4 ,5 ,6 ]
Austin, Peter C. [2 ,6 ,7 ]
机构
[1] Univ Calgary, Cumming Sch Med, Dept Clin Neurosci, 3330 Hosp Dr NW, Calgary, AB T2N 4N1, Canada
[2] ICES, Toronto, ON, Canada
[3] Univ Toronto, Dept Med Neurol, Sunnybrook Hlth Sci Ctr, Toronto, ON, Canada
[4] Univ Toronto, Dept Med Gen Internal Med, Toronto, ON, Canada
[5] Univ Hlth Network, Toronto, ON, Canada
[6] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[7] Sunnybrook Res Inst, Schulich Heart Res Program, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
Stroke; Hometime; Random forests; ACUTE ISCHEMIC-STROKE; ADMINISTRATIVE DATA; UNCERTAINTY LIFE; TIME; PREVALENCE; INDEX;
D O I
10.1186/s12874-021-01289-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Ninety-day hometime, the number of days a patient is living in the community in the first 90 after stroke, exhibits a non-normal bucket-shaped distribution, with lower and upper constraints making its analysis difficult. In this proof-of-concept study we evaluated the performance of random forests regression in the analysis of hometime. Methods Using administrative data we identified stroke hospitalizations between 2010 and 2017 in Ontario, Canada. We used random forests regression to predict 90-day hometime using 15 covariates. Model accuracy was determined using the r-squared statistic. Variable importance in prediction and the marginal effects of each covariate were explored. Results We identified 75,745 eligible patients. Median 90-day hometime was 59 days (Q1: 2, Q3: 83). Random forests predicted hometime with reasonable accuracy (adjusted r-squared 0.3462); no implausible values were predicted but extreme values were predicted with low accuracy. Frailty, stroke severity, and age exhibited inverse non-linear relationships with hometime and patients arriving by ambulance had less hometime than those who did not. Conclusions Random forests may be a useful method for analyzing 90-day hometime and capturing the complex non-linear relationships which exist between predictors and hometime. Future work should compare random forests to other models and focus on improving the accuracy of predictions of extreme values of hometime.
引用
收藏
页数:12
相关论文
共 30 条
[1]   A multicenter study of the coding accuracy of hospital discharge administrative data for patients admitted to cardiac care units in Ontario [J].
Austin, PC ;
Daly, PA ;
Tu, JV .
AMERICAN HEART JOURNAL, 2002, 144 (02) :290-296
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Endovascular Therapy for Ischemic Stroke with Perfusion-Imaging Selection [J].
Campbell, B. C. V. ;
Mitchell, P. J. ;
Kleinig, T. J. ;
Dewey, H. M. ;
Churilov, L. ;
Yassi, N. ;
Yan, B. ;
Dowling, R. J. ;
Parsons, M. W. ;
Oxley, T. J. ;
Wu, T. Y. ;
Brooks, M. ;
Simpson, M. A. ;
Miteff, F. ;
Levi, C. R. ;
Krause, M. ;
Harrington, T. J. ;
Faulder, K. C. ;
Steinfort, B. S. ;
Priglinger, M. ;
Ang, T. ;
Scroop, R. ;
Barber, P. A. ;
McGuinness, B. ;
Wijeratne, T. ;
Phan, T. G. ;
Chong, W. ;
Chandra, R. V. ;
Bladin, C. F. ;
Badve, M. ;
Rice, H. ;
de Villiers, L. ;
Ma, H. ;
Desmond, P. M. ;
Donnan, G. A. ;
Davis, S. M. .
NEW ENGLAND JOURNAL OF MEDICINE, 2015, 372 (11) :1009-1018
[4]   Outcome measures in acute stroke trials - A systematic review and some recommendations to improve practice [J].
Duncan, PW ;
Jorgensen, HS ;
Wade, DT .
STROKE, 2000, 31 (06) :1429-1438
[5]   Charlson comorbidity index as a predictor of in-hospital death in acute ischemic stroke among very old patients: a single-cohort perspective study [J].
Falsetti, Lorenzo ;
Viticchi, Giovanna ;
Tarquinio, Nicola ;
Silvestrini, Mauro ;
Capeci, William ;
Catozzo, Vania ;
Fioranelli, Agnese ;
Buratti, Laura ;
Pellegrini, Francesco .
NEUROLOGICAL SCIENCES, 2016, 37 (09) :1443-1448
[6]   Assessment of Home-Time After Acute Ischemic Stroke in Medicare Beneficiaries [J].
Fonarow, Gregg C. ;
Liang, Li ;
Thomas, Laine ;
Xian, Ying ;
Saver, Jeffrey L. ;
Smith, Eric E. ;
Schwamm, Lee H. ;
Peterson, Eric D. ;
Hernandez, Adrian F. ;
Duncan, Pamela W. ;
O'Brien, Emily C. ;
Bushnell, Cheryl ;
Bettger, Janet Prvu .
STROKE, 2016, 47 (03) :836-842
[7]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[8]   Development and validation of a Hospital Frailty Risk Score focusing on older people in acute care settings using electronic hospital records: an observational study [J].
Gilbert, Thomas ;
Neuburger, Jenny ;
Kraindler, Joshua ;
Keeble, Eilis ;
Smith, Paul ;
Ariti, Cono ;
Arora, Sandeepa ;
Street, Andrew ;
Parker, Stuart ;
Roberts, Helen C. ;
Bardsley, Martin ;
Conroy, Simon .
LANCET, 2018, 391 (10132) :1775-1782
[9]   Charlson Index comorbidity adjustment for ischemic stroke outcome studies [J].
Goldstein, LB ;
Samsa, GP ;
Matchar, DB ;
Horner, RD .
STROKE, 2004, 35 (08) :1941-1945
[10]   Certain Uncertainty Life After Stroke From the Patient's Perspective [J].
Hannah, Deidre ;
Lindholm, Brianna ;
Maisch, Lesley .
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2014, 7 (06) :968-969