Using the Electronic Medical Record to Identify Patients at High Risk for Frequent Emergency Department Visits and High System Costs

被引:36
作者
Frost, David W. [1 ,4 ,5 ,11 ]
Vembu, Shankar [6 ,11 ]
Wang, Jiayi [6 ,11 ]
Tu, Karen [2 ,3 ,4 ,11 ,12 ]
Morris, Quaid [6 ,7 ,8 ,9 ,10 ,11 ]
Abrams, Howard B. [1 ,4 ,5 ,11 ]
机构
[1] Univ Toronto, Div Gen Internal Med, Toronto, ON, Canada
[2] Univ Toronto, Dept Family & Community Med, Toronto, ON, Canada
[3] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[4] Univ Hlth Network, Toronto, ON, Canada
[5] Univ Hlth Network, OpenLab, Toronto, ON, Canada
[6] Donnelly Ctr Cellular & Biomol Res, Toronto, ON, Canada
[7] Banting & Best Dept Med Res, Toronto, ON, Canada
[8] Univ Toronto, Dept Med Genet, Toronto, ON, Canada
[9] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
[10] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[11] Univ Toronto, Toronto, ON, Canada
[12] Inst Clin Evaluat Sci, Toronto, ON, Canada
关键词
Electronic medical records; Frequent emergency department visits; High users; Machine learning; Predictive modeling; HEALTH-CARE; READMISSION; USERS; VALIDATION; ADMISSIONS; PREDICT; DEATH; MODEL; PEOPLE; TRIAL;
D O I
10.1016/j.amjmed.2016.12.008
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BACKGROUND: A small proportion of patients account for a high proportion of healthcare use. Accurate preemptive identification may facilitate tailored intervention. We sought to determine whether machine learning techniques using text from a family practice electronic medical record can be used to predict future high emergency department use and total costs by patients who are not yet high emergency department users or high cost to the healthcare system. METHODS: Text from fields of the cumulative patient profile within an electronic medical record of 43,111 patients was indexed. Separate training and validation cohorts were created. After processing, 11,905 words were used to fit a logistic regression model. The primary outcomes of interest in the 12 months after prediction were 3 or more emergency department visits and being in the top 5% in healthcare expenditures. Outcomes were assessed through linkage to administrative data bases housed at the Institute for Clinical Evaluative Sciences. RESULTS: In the model to predict frequent emergency department visits, after excluding patients who were high emergency department users in the previous year, the area under the receiver operating characteristic curve was 0.71. By using the same methodology, the model to predict the top 5% in total system costs had an area under the receiver operating characteristic curve of 0.76. CONCLUSIONS: Machine learning techniques can be applied to analyze free text contained in electronic medical records. This dataset is more predictive of patients who will generate future high costs than future emergency department visits. It remains to be seen whether these predictions can be used to reduce costs by early interventions in this cohort of patients. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:601.e17 / 601.e22
页数:6
相关论文
共 31 条
[1]   Patients in Context - EHR Capture of Social and Behavioral Determinants of Health [J].
Adler, Nancy E. ;
Stead, William W. .
NEW ENGLAND JOURNAL OF MEDICINE, 2015, 372 (08) :698-701
[2]   Electronic medical record-based multicondition models to predict the risk of 30 day readmission or death among adult medicine patients: validation and comparison to existing models [J].
Amarasingham, Ruben ;
Velasco, Ferdinand ;
Xie, Bin ;
Clark, Christopher ;
Ma, Ying ;
Zhang, Song ;
Bhat, Deepa ;
Lucena, Brian ;
Huesch, Marco ;
Halm, Ethan A. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15
[3]   Case finding for patients at risk of readmission to hospital: development of algorithm to identify high risk patients [J].
Billings, John ;
Dixon, Jennifer ;
Mijanovich, Tod ;
Wennberg, David .
BRITISH MEDICAL JOURNAL, 2006, 333 (7563) :327-330
[4]   Identifying patients at high risk of emergency hospital admissions: a logistic regression analysis [J].
Bottle, Alex ;
Aylin, Paul ;
Majeed, Azeem .
JOURNAL OF THE ROYAL SOCIETY OF MEDICINE, 2006, 99 (08) :406-414
[5]   Using Electronic Health Record Data to Develop and Validate a Prediction Model for Adverse Outcomes in the Wards [J].
Churpek, Matthew M. ;
Yuen, Trevor C. ;
Park, Seo Young ;
Gibbons, Robert ;
Edelson, Dana P. .
CRITICAL CARE MEDICINE, 2014, 42 (04) :841-848
[6]   Effect of a Postdischarge Virtual Ward on Readmission or Death for High-Risk Patients A Randomized Clinical Trial [J].
Dhalla, Irfan A. ;
O'Brien, Tara ;
Morra, Dante ;
Thorpe, Kevin E. ;
Wong, Brian M. ;
Mehta, Rajin ;
Frost, David W. ;
Abrams, Howard ;
Ko, Francoise ;
Van Rooyen, Patrick ;
Bell, Chaim M. ;
Gruneir, Andrea ;
Lewis, Geraint H. ;
Daub, Stacey ;
Anderson, Geoff M. ;
Hawker, Gillian A. ;
Rochon, Paula A. ;
Laupacis, Andreas .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2014, 312 (13) :1305-1312
[7]   Development and validation of a model for predicting emergency admissions over the next year (PEONY) - A UK historical cohort study [J].
Donnan, Peter T. ;
Dorward, David W. T. ;
Mutch, Bill ;
Morris, Andrew D. .
ARCHIVES OF INTERNAL MEDICINE, 2008, 168 (13) :1416-1422
[8]   Looking Beyond Income and Education Socioeconomic Status Gradients Among Future High-Cost Users of Health Care [J].
Fitzpatrick, Tiffany ;
Rosella, Laura C. ;
Calzavara, Andrew ;
Petch, Jeremy ;
Pinto, Andrew D. ;
Manson, Heather ;
Goel, Vivek ;
Wodchis, Walter P. .
AMERICAN JOURNAL OF PREVENTIVE MEDICINE, 2015, 49 (02) :161-171
[9]   Using a mailed survey to predict hospital admission among patients older than 80 [J].
Freedman, JD ;
Beck, A ;
Robertson, B ;
Calonge, BN ;
Gade, G .
JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 1996, 44 (06) :689-692
[10]  
Georghiu T, 2011, PREDICTIVE RISK HEAL