Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study

被引:8
作者
Ge, Wendong [1 ]
Alabsi, Haitham [1 ]
Jain, Aayushee [1 ]
Ye, Elissa [1 ]
Sun, Haoqi [1 ]
Fernandes, Marta [1 ]
Magdamo, Colin [1 ]
Tesh, Ryan A. [1 ]
Collens, Sarah, I [1 ]
Newhouse, Amy [1 ]
Moura, Lidia M. V. R. [1 ]
Zafar, Sahar [1 ]
Hsu, John [1 ]
Akeju, Oluwaseun [1 ]
Robbins, Gregory K. [1 ]
Mukerji, Shibani S. [1 ]
Das, Sudeshna [1 ]
Westover, M. Brandon [1 ]
机构
[1] Massachusetts Gen Hosp, 50 Staniford St, Boston, MA 02114 USA
基金
美国国家卫生研究院;
关键词
delirium; electronic health records; clinical notes; machine learning; natural language processing;
D O I
10.2196/33834
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Delirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate. Objective: We sought to develop a more accurate method using natural language processing (NLP) to detect delirium episodes on the basis of unstructured clinical notes. Methods: We collected 1.5 million notes from >10,000 patients from among 9 hospitals. Seven experts iteratively labeled 200,471 sentences. Using these, we trained three NLP classifiers: Support Vector Machine, Recurrent Neural Networks, and Transformer. Testing was performed using an external data set. We also evaluated associations with delirium billing (ICD) codes, medications, orders for restraints and sitters, direct assessments (Confusion Assessment Method [CAM] scores), and in-hospital mortality. F1 scores, confusion matrices, and areas under the receiver operating characteristic curve (AUCs) were used to compare NLP models. We used the phi coefficient to measure associations with other delirium indicators. Results: The transformer NLP performed best on the following parameters: micro F1=0.978, macro F1=0.918, positive AUC=0.984, and negative AUC=0.992. NLP detections exhibited higher correlations (phi) than ICD codes with deliriogenic medications (0.194 vs 0.073 for ICD codes), restraints and sitter orders (0.358 vs 0.177), mortality (0.216 vs 0.000), and CAM scores (0.256 vs -0.028). Conclusions: Clinical notes are an attractive alternative to ICD codes for EHR delirium studies but require automated methods. Our NLP model detects delirium with high accuracy, similar to manual chart review. Our NLP approach can provide more accurate determination of delirium for large-scale EHR-based studies regarding delirium, quality improvement, and clinical trails.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Depression and anxiety increase the odds of developing delirium in ICU patients; a prospective observational study
    Arbabi, Mohammad
    Dezhdar, Zhaleh
    Amini, Behnam
    Dehnavi, Ali Zare
    Ghasemi, Moein
    COGNITIVE NEUROPSYCHIATRY, 2022, 27 (01) : 1 - 10
  • [32] The Association of a Frailty Index and Incident Delirium in Older Hospitalized Patients: An Observational Cohort Study
    Sillner, Andrea Yevchak
    McConeghy, Robert Owens
    Madrigal, Caroline
    Culley, Deborah J.
    Arora, Rakesh C.
    Rudolph, James L.
    CLINICAL INTERVENTIONS IN AGING, 2020, 15 : 2053 - 2061
  • [33] Building large-scale registries from unstructured clinical notes using a low-resource natural language processing pipeline
    Tavabi, Nazgol
    Pruneski, James
    Golchin, Shahriar
    Singh, Mallika
    Sanborn, Ryan
    Heyworth, Benton
    Landschaft, Assaf
    Kimia, Amir
    Kiapour, Ata
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 151
  • [34] Delirium in COVID-19 and post-liver transplant patients: an observational study
    Fiore, Gianluca
    Ferrari, Silvia
    Cutino, Anna
    Giorgino, Claudia
    Valeo, Laura
    Galeazzi, Gian M.
    Marchi, Mattia
    INTERNATIONAL JOURNAL OF PSYCHIATRY IN CLINICAL PRACTICE, 2022, 26 (04) : 343 - 351
  • [35] Towards unstructured mortality prediction with free-text clinical notes
    Hashir, Mohammad
    Sawhney , Rapinder
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 108 (108)
  • [36] Identifying vulnerable older adult populations by contextualizing geriatric syndrome information in clinical notes of electronic health records
    Chen, Tao
    Dredze, Mark
    Weiner, Jonathan R.
    Kharrazi, Hadi
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (8-9) : 787 - 795
  • [37] A prospective observational study to investigate utility of the Delirium Observational Screening Scale (DOSS) to detect delirium in care home residents
    Teale, E. A.
    Munyombwe, T.
    Schuurmans, M.
    Siddiqi, N.
    Young, J.
    AGE AND AGEING, 2018, 47 (01) : 56 - 61
  • [38] Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
    Obeid, Jihad S.
    Dahne, Jennifer
    Christensen, Sean
    Howard, Samuel
    Crawford, Tami
    Frey, Lewis J.
    Stecker, Tracy
    Bunnell, Brian E.
    JMIR MEDICAL INFORMATICS, 2020, 8 (07)
  • [39] Automatic trial eligibility surveillance based on unstructured clinical data
    Meystre, Stephane M.
    Heider, Paul M.
    Kim, Youngjun
    Aruch, Daniel B.
    Britten, Carolyn D.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 : 13 - 19
  • [40] Predicting ICD-9 code groups with fuzzy similarity based supervised multi-label classification of unstructured clinical nursing notes
    Gangavarapu, Tushaar
    Jayasimha, Aditya
    Krishnan, Gokul S.
    Kamath, Sowmya S.
    KNOWLEDGE-BASED SYSTEMS, 2020, 190