Predicting early psychiatric readmission with natural language processing of narrative discharge summaries

被引:117
作者
Rumshisky, A. [1 ,2 ]
Ghassemi, M. [1 ]
Naumann, T. [1 ]
Szolovits, P. [1 ]
Castro, V. M. [3 ,4 ,5 ,6 ]
Mccoy, T. H. [3 ,4 ,5 ]
Perlis, R. H. [3 ,4 ,5 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Univ Massachusetts Lowell, Dept Comp Sci, Lowell, MA USA
[3] Massachusetts Gen Hosp, Ctr Expt Drugs & Diagnost, Boston, MA 02114 USA
[4] Massachusetts Gen Hosp, Dept Psychiat, Simches Res Bldg,185 Cambridge St,6th Floor, Boston, MA 02114 USA
[5] Massachusetts Gen Hosp, Ctr Human Genet Res, Boston, MA 02114 USA
[6] Partners HealthCare Syst, Partners Res Informat Syst & Comp, Boston, MA USA
来源
TRANSLATIONAL PSYCHIATRY | 2016年 / 6卷
关键词
ELECTRONIC MEDICAL-RECORDS; IDENTIFICATION; DEPRESSION;
D O I
10.1038/tp.2015.182
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline + 1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established in other clinical cohorts.
引用
收藏
页码:e921 / e921
页数:5
相关论文
共 10 条
  • [1] Natural language processing applied to mental illness detection: a narrative review
    Zhang, Tianlin
    Schoene, Annika M.
    Ji, Shaoxiong
    Ananiadou, Sophia
    NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [2] Predicting neuroticism with open-ended response using natural language processing
    Yoon, Seowon
    Jang, Jihee
    Son, Gaeun
    Park, Soohyun
    Hwang, Jueun
    Choeh, Joon Yeon
    Choi, Kee-Hong
    FRONTIERS IN PSYCHIATRY, 2024, 15
  • [3] Understanding psychiatric illness through natural language processing (UNDERPIN): Rationale, design, and methodology
    Kishimoto, Taishiro
    Nakamura, Hironobu
    Kano, Yoshinobu
    Eguchi, Yoko
    Kitazawa, Momoko
    Liang, Kuo-ching
    Kudo, Koki
    Sento, Ayako
    Takamiya, Akihiro
    Horigome, Toshiro
    Yamasaki, Toshihiko
    Sunami, Yuki
    Kikuchi, Toshiaki
    Nakajima, Kazuki
    Tomita, Masayuki
    Bun, Shogyoku
    Momota, Yuki
    Sawada, Kyosuke
    Murakami, Junichi
    Takahashi, Hidehiko
    Mimura, Masaru
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [4] Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing
    Fernandes, Andrea C.
    Dutta, Rina
    Velupillai, Sumithra
    Sanyal, Jyoti
    Stewart, Robert
    Chandran, David
    SCIENTIFIC REPORTS, 2018, 8
  • [5] Construction of an Assisted Model Based on Natural Language Processing for Automatic Early Diagnosis of Autoimmune Encephalitis
    Zhao, Yunsong
    Ren, Bin
    Yu, Wenjin
    Zhang, Haijun
    Zhao, Di
    Lv, Junchao
    Xie, Zhen
    Jiang, Kun
    Shang, Lei
    Yao, Han
    Xu, Yongyong
    Zhao, Gang
    NEUROLOGY AND THERAPY, 2022, 11 (03) : 1117 - 1134
  • [6] Construction of an Assisted Model Based on Natural Language Processing for Automatic Early Diagnosis of Autoimmune Encephalitis
    Yunsong Zhao
    Bin Ren
    Wenjin Yu
    Haijun Zhang
    Di Zhao
    Junchao Lv
    Zhen Xie
    Kun Jiang
    Lei Shang
    Han Yao
    Yongyong Xu
    Gang Zhao
    Neurology and Therapy, 2022, 11 : 1117 - 1134
  • [7] Natural Language Processing Analysis of Psychiatric Progress Notes: An Innovative Approach to Real-Time Unbiased Construct Quantification
    Vogelgsang, Jonathan
    Holstein, Vincent
    Dan, Shu
    Abston, Joshua
    Mccoy, Tom
    Klengel, Torsten
    Ressler, Kerry
    Berretta, Sabina
    BIOLOGICAL PSYCHIATRY, 2024, 95 (10) : S168 - S168
  • [8] Predicting which patients with cancer will see a psychiatrist or counsellor from their initial oncology consultation document using natural language processing
    Nunez, John-Jose
    Leung, Bonnie
    Ho, Cheryl
    Ng, Raymond T.
    Bates, Alan T.
    COMMUNICATIONS MEDICINE, 2024, 4 (01):
  • [9] Novel Use of Natural Language Processing (NLP) to Predict Suicidal Ideation and Psychiatric Symptoms in a Text-Based Mental Health Intervention in Madrid
    Cook, Benjamin L.
    Progovac, Ana M.
    Chen, Pei
    Mullin, Brian
    Hou, Sherry
    Baca-Garcia, Enrique
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [10] A Comparison of Connected Speech Tasks for Detecting Early Alzheimer's Disease and Mild Cognitive Impairment Using Natural Language Processing and Machine Learning
    Clarke, Natasha
    Barrick, Thomas R.
    Garrard, Peter
    FRONTIERS IN COMPUTER SCIENCE, 2021, 3