Disambiguation of acronyms in clinical narratives with large language models

被引:3
作者
Kugic, Amila [1 ]
Schulz, Stefan [1 ]
Kreuzthaler, Markus [1 ]
机构
[1] Med Univ Graz, Inst Med Informat Stat & Documentat, Auenbruggerpl 2-5, A-8036 Graz, Austria
关键词
natural language processing; large language models; electronic health records; acronyms; HEALTH;
D O I
10.1093/jamia/ocae157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective To assess the performance of large language models (LLMs) for zero-shot disambiguation of acronyms in clinical narratives.Materials and Methods Clinical narratives in English, German, and Portuguese were applied for testing the performance of four LLMs: GPT-3.5, GPT-4, Llama-2-7b-chat, and Llama-2-70b-chat. For English, the anonymized Clinical Abbreviation Sense Inventory (CASI, University of Minnesota) was used. For German and Portuguese, at least 500 text spans were processed. The output of LLM models, prompted with contextual information, was analyzed to compare their acronym disambiguation capability, grouped by document-level metadata, the source language, and the LLM.Results On CASI, GPT-3.5 achieved 0.91 in accuracy. GPT-4 outperformed GPT-3.5 across all datasets, reaching 0.98 in accuracy for CASI, 0.86 and 0.65 for two German datasets, and 0.88 for Portuguese. Llama models only reached 0.73 for CASI and failed severely for German and Portuguese. Across LLMs, performance decreased from English to German and Portuguese processing languages. There was no evidence that additional document-level metadata had a significant effect.Conclusion For English clinical narratives, acronym resolution by GPT-4 can be recommended to improve readability of clinical text by patients and professionals. For German and Portuguese, better models are needed. Llama models, which are particularly interesting for processing sensitive content on premise, cannot yet be recommended for acronym resolution.
引用
收藏
页码:2040 / 2046
页数:7
相关论文
共 23 条
  • [1] Abacha A.B., 2023, P ANN M ASS COMP LIN, P503
  • [2] Adams G, 2020, PR MACH LEARN RES, V136, P12
  • [3] Agrawal M., 2022, P 2022 C EMPIRICAL M, P1998, DOI [10.18653/v1/2022.emnlp-main.130, DOI 10.18653/V1/2022.EMNLP-MAIN.130]
  • [4] ChatGPT's Ability to Assist with Clinical Documentation: A Randomized Controlled Trial
    Baker, Hayden P.
    Dwyer, Emma
    Kalidoss, Senthooran
    Hynes, Kelly
    Wolf, Jennifer
    Strelzow, Jason A.
    [J]. JOURNAL OF THE AMERICAN ACADEMY OF ORTHOPAEDIC SURGEONS, 2024, 32 (03) : 123 - 129
  • [5] ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations
    Dave, Tirth
    Athaluri, Sai Anirudh
    Singh, Satyam
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [6] Dreano Soren, 2023, P 8 C MACH TRANSL, P738
  • [7] Million Veteran Program: A mega-biobank to study genetic influences on health and disease
    Gaziano, John Michael
    Concato, John
    Brophy, Mary
    Fiore, Louis
    Pyarajan, Saiju
    Breeling, James
    Whitbourne, Stacey
    Deen, Jennifer
    Shannon, Colleen
    Humphries, Donald
    Guarino, Peter
    Aslan, Mihaela
    Anderson, Daniel
    LaFleur, Rene
    Hammond, Timothy
    Schaa, Kendra
    Moser, Jennifer
    Huang, Grant
    Muralidhar, Sumitra
    Przygodzki, Ronald
    O'Leary, Timothy J.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2016, 70 : 214 - 223
  • [8] The CLASSE GATOR (CLinical Acronym SenSE disambiGuATOR): A Method for predicting acronym sense from neonatal clinical notes
    Kashyap, Aditya
    Burris, Heather
    Callison-Burch, Chris
    Boland, Mary Regina
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 137
  • [9] Kugic Amila, 2023, Stud Health Technol Inform, V309, P78, DOI 10.3233/SHTI230743
  • [10] Binary acronym disambiguation in clinical notes from electronic health records with an application in computational phenotyping
    Link, Nicholas B.
    Huang, Sicong
    Cai, Tianrun
    Sun, Jiehuan
    Dahal, Kumar
    Costa, Lauren
    Cho, Kelly
    Liao, Katherine
    Cai, Tianxi
    Hong, Chuan
    Collaboration Million Vet Program, Million Veteran Program
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 162