Identifying signs and symptoms of urinary tract infection from emergency department clinical notes using large language models

被引:4
作者
Iscoe, Mark [1 ,2 ]
Socrates, Vimig [2 ,3 ]
Gilson, Aidan [4 ]
Chi, Ling [5 ]
Li, Huan [3 ]
Huang, Thomas [4 ]
Kearns, Thomas [1 ]
Perkins, Rachelle [1 ]
Khandjian, Laura [1 ]
Taylor, R. Andrew [1 ,2 ]
机构
[1] Yale Sch Med, Dept Emergency Med, New Haven, CT 06519 USA
[2] Yale Univ, Sch Med, Sect Biomed Informat & Data Sci, New Haven, CT USA
[3] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT USA
[4] Yale Sch Med, New Haven, CT 06519 USA
[5] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
关键词
emergency medicine; infectious diseases; informatics; large language models; named entity recognition; natural language processing; urinary tract infection; INFORMATION; AGREEMENT; CARE; EXTRACTION; MANAGEMENT; ACCURACY; CRITERIA;
D O I
10.1111/acem.14883
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
BackgroundNatural language processing (NLP) tools including recently developed large language models (LLMs) have myriad potential applications in medical care and research, including the efficient labeling and classification of unstructured text such as electronic health record (EHR) notes. This opens the door to large-scale projects that rely on variables that are not typically recorded in a structured form, such as patient signs and symptoms.ObjectivesThis study is designed to acquaint the emergency medicine research community with the foundational elements of NLP, highlighting essential terminology, annotation methodologies, and the intricacies involved in training and evaluating NLP models. Symptom characterization is critical to urinary tract infection (UTI) diagnosis, but identification of symptoms from the EHR has historically been challenging, limiting large-scale research, public health surveillance, and EHR-based clinical decision support. We therefore developed and compared two NLP models to identify UTI symptoms from unstructured emergency department (ED) notes.MethodsThe study population consisted of patients aged >= 18 who presented to an ED in a northeastern U.S. health system between June 2013 and August 2021 and had a urinalysis performed. We annotated a random subset of 1250 ED clinician notes from these visits for a list of 17 UTI symptoms. We then developed two task-specific LLMs to perform the task of named entity recognition: a convolutional neural network-based model (SpaCy) and a transformer-based model designed to process longer documents (Clinical Longformer). Models were trained on 1000 notes and tested on a holdout set of 250 notes. We compared model performance (precision, recall, F1 measure) at identifying the presence or absence of UTI symptoms at the note level.ResultsA total of 8135 entities were identified in 1250 notes; 83.6% of notes included at least one entity. Overall F1 measure for note-level symptom identification weighted by entity frequency was 0.84 for the SpaCy model and 0.88 for the Longformer model. F1 measure for identifying presence or absence of any UTI symptom in a clinical note was 0.96 (232/250 correctly classified) for the SpaCy model and 0.98 (240/250 correctly classified) for the Longformer model.ConclusionsThe study demonstrated the utility of LLMs and transformer-based models in particular for extracting UTI symptoms from unstructured ED clinical notes; models were highly accurate for detecting the presence or absence of any UTI symptom on the note level, with variable performance for individual symptoms.
引用
收藏
页码:599 / 610
页数:12
相关论文
共 27 条
  • [21] Predicting seizure recurrence after an initial seizure-like episode from routine clinical notes using large language models: a retrospective cohort study
    Beaulieu-Jones, Brett K.
    Villamar, Mauricio F.
    Scordis, Phil
    Bartmann, Ana Paula
    Ali, Waqar
    Wissel, Benjamin
    Alsentzer, Emily
    de Jong, Johann
    Patra, Arijit
    Kohane, Isaac
    LANCET DIGITAL HEALTH, 2023, 5 (12): : E882 - E894
  • [22] Early identification of suspected serious infection among patients afebrile at initial presentation using neural network models and natural language processing: A development and external validation study in the emergency department
    Choi, Dong Hyun
    Choi, Sae Won
    Kim, Ki Hong
    Choi, Yeongho
    Kim, Yoonjic
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2024, 80 : 67 - 76
  • [23] Discrediting microscopic pyuria and leucocyte esterase as diagnostic surrogates for infection in patients with lower urinary tract symptoms: results from a clinical and laboratory evaluation
    Kupelian, Anthony S.
    Horsley, Harry
    Khasriya, Rajvinder
    Amussah, Rasheedah T.
    Badiani, Raj
    Courtney, Angela M.
    Chandhyoke, Nihil S.
    Riaz, Usama
    Savlani, Karishma
    Moledina, Malik
    Montes, Samantha
    O'Connor, Dominic
    Visavadia, Rakhee
    Kelsey, Michael
    Rohn, Jennifer L.
    Malone-Lee, James
    BJU INTERNATIONAL, 2013, 112 (02) : 231 - 238
  • [24] Constructing synthetic datasets with generative artificial intelligence to train large language models to classify acute renal failure from clinical notes
    Litake, Onkar
    Park, Brian H.
    Tully, Jeffrey L.
    Gabriel, Rodney A.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (06) : 1404 - 1410
  • [25] Predictive factors for identifying infection source using combined chest-abdominal computed tomography in acute febrile older patients exhibiting no clinical indications in the emergency department
    Sung, Won Young
    Kim, Jin Cheol
    Seo, Sang Won
    Lee, Keun Taek
    Yang, Heebum
    SIGNA VITAE, 2024, 20 (07) : 86 - 95
  • [26] Building large-scale registries from unstructured clinical notes using a low-resource natural language processing pipeline
    Tavabi, Nazgol
    Pruneski, James
    Golchin, Shahriar
    Singh, Mallika
    Sanborn, Ryan
    Heyworth, Benton
    Landschaft, Assaf
    Kimia, Amir
    Kiapour, Ata
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 151
  • [27] Clustering of clinical symptoms using large language models reveals low diagnostic specificity of proposed alternatives to consensus mast cell activation syndrome criteria
    Solomon, Benjamin D.
    Khatri, Purvesh
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2025, 155 (01) : 213 - 218.e4