Hard for humans, hard for machines: predicting readmission after psychiatric hospitalization using narrative notes

被引:18
作者
Boag, William [1 ]
Kovaleva, Olga [2 ]
McCoy, Thomas H., Jr. [3 ]
Rumshisky, Anna [2 ]
Szolovits, Peter [1 ]
Perlis, Roy H. [3 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Univ Massachusetts Lowell, Dept Comp Sci, Lowell, MA 01854 USA
[3] Massachusetts Gen Hosp, Div Clin Res, Ctr Quantitat Hlth, 185 Cambridge St, Boston, MA 02114 USA
基金
美国国家科学基金会;
关键词
SUICIDE; DEATH;
D O I
10.1038/s41398-020-01104-w
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Machine learning has been suggested as a means of identifying individuals at greatest risk for hospital readmission, including psychiatric readmission. We sought to compare the performance of predictive models that use interpretable representations derived via topic modeling to the performance of human experts and nonexperts. We examined all 5076 admissions to a general psychiatry inpatient unit between 2009 and 2016 using electronic health records. We developed multiple models to predict 180-day readmission for these admissions based on features derived from narrative discharge summaries, augmented by baseline sociodemographic and clinical features. We developed models using a training set comprising 70% of the cohort and evaluated on the remaining 30%. Baseline models using demographic features for prediction achieved an area under the curve (AUC) of 0.675 [95% CI 0.674-0.676] on an independent testing set, while language-based models also incorporating bag-of-words features, discharge summaries topics identified by Latent Dirichlet allocation (LDA), and prior psychiatric admissions achieved AUC of 0.726 [95% CI 0.725-0.727]. To characterize the difficulty of the task, we also compared the performance of these classifiers to both expert and nonexpert human raters, with and without feedback, on a subset of 75 test cases. These models outperformed humans on average, including predictions by experienced psychiatrists. Typical note tokens or topics associated with readmission risk were related to pregnancy/postpartum state, family relationships, and psychosis.
引用
收藏
页数:6
相关论文
共 18 条
[1]  
Agency for Healthcare Research and Quality, 2017, HCUPnet Inpatient Stays, National Statistics by Clinical Classification Software Refined (CCSR), Principal Diagnosis. All Payer. Cost
[2]  
[Anonymous], 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[3]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[4]   VALIDATION OF A COMBINED COMORBIDITY INDEX [J].
CHARLSON, M ;
SZATROWSKI, TP ;
PETERSON, J ;
GOLD, J .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1994, 47 (11) :1245-1251
[5]   A NEW METHOD OF CLASSIFYING PROGNOSTIC CO-MORBIDITY IN LONGITUDINAL-STUDIES - DEVELOPMENT AND VALIDATION [J].
CHARLSON, ME ;
POMPEI, P ;
ALES, KL ;
MACKENZIE, CR .
JOURNAL OF CHRONIC DISEASES, 1987, 40 (05) :373-383
[6]   The "surprise question" for predicting death in seriously ill patients: a systematic review and meta-analysis [J].
Downar, James ;
Goldman, Russell ;
Pinto, Ruxandra ;
Englesakis, Marina ;
Adhikari, Neill K. J. .
CANADIAN MEDICAL ASSOCIATION JOURNAL, 2017, 189 (13) :E484-E493
[7]  
ek R.ehu r., 2010, P LREC 2010 WORKSHOP, P45
[8]   Recurrent nets that time and count [J].
Gers, FA ;
Schmidhuber, J .
IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, :189-194
[9]  
Hu WW, 2016, MAGNESIUM TECHNOLOGY 2016, P169
[10]   Sentiment Measured in Hospital Discharge Notes Is Associated with Readmission and Mortality Risk: An Electronic Health Record Study [J].
McCoy, Thomas H. ;
Castro, Victor M. ;
Cagan, Andrew ;
Roberson, Ashlee M. ;
Kohane, Isaac S. ;
Perlis, Roy H. .
PLOS ONE, 2015, 10 (08)