Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study

被引:8
|
作者
Ge, Wendong [1 ]
Alabsi, Haitham [1 ]
Jain, Aayushee [1 ]
Ye, Elissa [1 ]
Sun, Haoqi [1 ]
Fernandes, Marta [1 ]
Magdamo, Colin [1 ]
Tesh, Ryan A. [1 ]
Collens, Sarah, I [1 ]
Newhouse, Amy [1 ]
Moura, Lidia M. V. R. [1 ]
Zafar, Sahar [1 ]
Hsu, John [1 ]
Akeju, Oluwaseun [1 ]
Robbins, Gregory K. [1 ]
Mukerji, Shibani S. [1 ]
Das, Sudeshna [1 ]
Westover, M. Brandon [1 ]
机构
[1] Massachusetts Gen Hosp, 50 Staniford St, Boston, MA 02114 USA
基金
美国国家卫生研究院;
关键词
delirium; electronic health records; clinical notes; machine learning; natural language processing;
D O I
10.2196/33834
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Delirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate. Objective: We sought to develop a more accurate method using natural language processing (NLP) to detect delirium episodes on the basis of unstructured clinical notes. Methods: We collected 1.5 million notes from >10,000 patients from among 9 hospitals. Seven experts iteratively labeled 200,471 sentences. Using these, we trained three NLP classifiers: Support Vector Machine, Recurrent Neural Networks, and Transformer. Testing was performed using an external data set. We also evaluated associations with delirium billing (ICD) codes, medications, orders for restraints and sitters, direct assessments (Confusion Assessment Method [CAM] scores), and in-hospital mortality. F1 scores, confusion matrices, and areas under the receiver operating characteristic curve (AUCs) were used to compare NLP models. We used the phi coefficient to measure associations with other delirium indicators. Results: The transformer NLP performed best on the following parameters: micro F1=0.978, macro F1=0.918, positive AUC=0.984, and negative AUC=0.992. NLP detections exhibited higher correlations (phi) than ICD codes with deliriogenic medications (0.194 vs 0.073 for ICD codes), restraints and sitter orders (0.358 vs 0.177), mortality (0.216 vs 0.000), and CAM scores (0.256 vs -0.028). Conclusions: Clinical notes are an attractive alternative to ICD codes for EHR delirium studies but require automated methods. Our NLP model detects delirium with high accuracy, similar to manual chart review. Our NLP approach can provide more accurate determination of delirium for large-scale EHR-based studies regarding delirium, quality improvement, and clinical trails.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Publicly available machine learning models for identifying opioid misuse from the clinical notes of hospitalized patients
    Sharma, Brihat
    Dligach, Dmitriy
    Swope, Kristin
    Salisbury-Afshar, Elizabeth
    Karnik, Niranjan S.
    Joyce, Cara
    Afshar, Majid
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [22] Incidence, characteristics, and outcomes of delirium in patients with noninvasive ventilation: a prospective observational study
    Zhang, Rui
    Bai, Linfu
    Han, Xiaoli
    Huang, Shicong
    Zhou, Lintong
    Duan, Jun
    BMC PULMONARY MEDICINE, 2021, 21 (01)
  • [23] Prevalence of Delirium in End-of-Life Palliative Care Patients: An Observational Study
    Recchia, Angela
    Rizzi, Barbara
    Favero, Alessandra
    Nobili, Alessandro
    Pasina, Luca
    MEDICAL PRINCIPLES AND PRACTICE, 2022, 31 (02) : 118 - 124
  • [24] Delirium motor subtypes and prognosis in hospitalized geriatric patients - A prospective observational study
    Evensen, Sigurd
    Saltvedt, Ingvild
    Lydersen, Stian
    Wyller, Torgeir Bruun
    Taraldsen, Kristin
    Sletvold, Olav
    JOURNAL OF PSYCHOSOMATIC RESEARCH, 2019, 122 : 24 - 28
  • [25] TAGS: Towards Automated Classification of Unstructured Clinical Nursing Notes
    Gangavarapu, Tushaar
    Jayasimha, Aditya
    Krishnan, Gokul S.
    Kamath, Sowmya S.
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 195 - 207
  • [26] Identifying delirium in Parkinson disease: A pilot study
    Lawson, Rachael A.
    Richardson, Sarah J.
    Yarnall, Alison J.
    Burn, David J.
    Allan, Louise M.
    INTERNATIONAL JOURNAL OF GERIATRIC PSYCHIATRY, 2020, 35 (05) : 547 - 552
  • [27] Identification of delirium from real-world electronic health record clinical notes
    St Sauver, Jennifer
    Fu, Sunyang
    Sohn, Sunghwan
    Weston, Susan
    Fan, Chun
    Olson, Janet
    Thorsteinsdottir, Bjoerg
    Lebrasseur, Nathan
    Pagali, Sandeep
    Rocca, Walter
    Liu, Hongfang
    JOURNAL OF CLINICAL AND TRANSLATIONAL SCIENCE, 2023, 7 (01)
  • [28] Identifying Symptom Information in Clinical Notes Using Natural Language Processing
    Koleck, Theresa A.
    Tatonetti, Nicholas P.
    Bakken, Suzanne
    Mitha, Shazia
    Henderson, Morgan M.
    George, Maureen
    Miaskowski, Christine
    Smaldone, Arlene
    Topaz, Maxim
    NURSING RESEARCH, 2021, 70 (03) : 173 - 183
  • [29] Identifying Mentions of Life Stressors in Clinical Notes
    Datar, Shreya
    Lindemann, Elizabeth A.
    Silverman, Greg
    McEwan, Reed
    Finzel, Raymond
    Kotlyar, Michael
    Melton, Genevieve B.
    Pakhomov, Serguei V. S.
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 153 - 160
  • [30] External Validation of a Machine Learning Based Delirium Prediction Software in Clinical Routine
    Jauk, Stefanie
    Veeranki, Sai Pavan Kumar
    Kramer, Diether
    Hoegler, Stefan
    Muehleckerb, David
    Eberhartlb, Erwin
    Schuelerb, Arthur
    Chvostac, Christian
    Strasserc, Wolfgang
    Strasserb, Reinhold
    Leodoltera, Werner
    DHEALTH 2022-PROCEEDINGS OF THE 16TH HEALTH INFORMATICS MEETS DIGITAL HEALTH CONFERENCE, 2022, 293 : 93 - 100