Current applications and challenges in large language models for patient care: a systematic review

被引:12
作者
Busch, Felix [1 ]
Hoffmann, Lena [2 ,3 ,4 ]
Rueger, Christopher [2 ,3 ,4 ]
van Dijk, Elon H. C.
Kader, Rawen [5 ]
Ortiz-Prado, Esteban [6 ]
Makowski, Marcus R. [1 ]
Saba, Luca [7 ]
Hadamitzky, Martin [8 ]
Kather, Jakob Nikolas [9 ,10 ]
Truhn, Daniel [11 ]
Cuocolo, Renato [12 ]
Adams, Lisa C. [1 ]
Bressem, Keno K. [1 ,8 ]
机构
[1] Tech Univ Munich, TUM Univ Hosp, Sch Med & Hlth, Dept Diagnost & Intervent Radiol,Klinikum Rechts I, Munich, Germany
[2] Charite Univ Med Berlin, Dept Neuroradiol, Hindenburgdamm 30, D-12203 Berlin, Germany
[3] Free Univ Berlin, Berlin, Germany
[4] Humboldt Univ, Berlin, Germany
[5] UCL, Div Surg & Intervent Sci, London, England
[6] Univ Americas, Fac Hlth Sci, Hlth Res Grp 1, Quito, Ecuador
[7] Azienda Osped Univ AOU, Dept Radiol, I-09045 Cagliari, Italy
[8] Tech Univ Munich, TUM Univ Hosp, Sch Med & Hlth, German Heart Ctr Munich,Inst Cardiovasc Radiol & N, Munich, Germany
[9] Heidelberg Univ Hosp, Natl Ctr Tumor Dis NCT, Dept Med Oncol, Heidelberg, Germany
[10] Tech Univ Dresden, Med Fac Carl Gustav Carus, Else Kroener Fresenius Ctr Digital Hlth, Dresden, Germany
[11] Univ Hosp Aachen, Dept Diagnost & Intervent Radiol, Aachen, Germany
[12] Univ Salerno, Dept Med Surg & Dent, Baronissi, Italy
来源
COMMUNICATIONS MEDICINE | 2025年 / 5卷 / 01期
关键词
CHATGPT; QUESTIONS; CANCER; PERFORMANCE; MEDICINE; BARD;
D O I
10.1038/s43856-024-00717-2
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
BackgroundThe introduction of large language models (LLMs) into clinical practice promises to improve patient education and empowerment, thereby personalizing medical care and broadening access to medical knowledge. Despite the popularity of LLMs, there is a significant gap in systematized information on their use in patient care. Therefore, this systematic review aims to synthesize current applications and limitations of LLMs in patient care.MethodsWe systematically searched 5 databases for qualitative, quantitative, and mixed methods articles on LLMs in patient care published between 2022 and 2023. From 4349 initial records, 89 studies across 29 medical specialties were included. Quality assessment was performed using the Mixed Methods Appraisal Tool 2018. A data-driven convergent synthesis approach was applied for thematic syntheses of LLM applications and limitations using free line-by-line coding in Dedoose.ResultsWe show that most studies investigate Generative Pre-trained Transformers (GPT)-3.5 (53.2%, n = 66 of 124 different LLMs examined) and GPT-4 (26.6%, n = 33/124) in answering medical questions, followed by patient information generation, including medical text summarization or translation, and clinical documentation. Our analysis delineates two primary domains of LLM limitations: design and output. Design limitations include 6 second-order and 12 third-order codes, such as lack of medical domain optimization, data transparency, and accessibility issues, while output limitations include 9 second-order and 32 third-order codes, for example, non-reproducibility, non-comprehensiveness, incorrectness, unsafety, and bias.ConclusionsThis review systematically maps LLM applications and limitations in patient care, providing a foundational framework and taxonomy for their implementation and evaluation in healthcare settings.
引用
收藏
页数:13
相关论文
共 151 条
[1]   Large language models show human- like content biases in transmission chain experiments [J].
Acerbi, Alberto ;
Stubbersfield, Joseph M. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (44)
[2]   Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study [J].
Adams, Lisa C. ;
Truhn, Daniel ;
Busch, Felix ;
Kader, Avan ;
Niehues, Stefan M. ;
Makowski, Marcus R. ;
Bressem, Keno K. .
RADIOLOGY, 2023, 307 (04)
[3]   Online patient education in body contouring: A comparison between Google and ChatGPT [J].
Alessandri-Bonetti, Mario ;
Liu, Hilary Y. ;
Palmesano, Marco ;
Nguyen, Vu T. ;
Egro, Francesco M. .
JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2023, 87 :390-402
[4]  
Ali H., 2023, iGIE, V2, P553, DOI 10.1016/j.igie.2023.10.001
[5]   Explainability for artificial intelligence in healthcare: a multidisciplinary perspective [J].
Amann, Julia ;
Blasimme, Alessandro ;
Vayena, Effy ;
Frey, Dietmar ;
Madai, Vince I. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
[6]  
[Anonymous], 2024, Dedoose Version 9.2.4, cloud application for managing, analyzing, and presenting qualitative and mixed method research data
[7]  
Athavale Anand, 2023, JVS Vasc Insights, V1, DOI 10.1016/j.jvsvi.2023.100019
[8]  
Atil B, 2024, Arxiv, DOI arXiv:2408.04667
[9]   Head-to-Head Comparison of ChatGPT Versus Google Search for Medical Knowledge Acquisition [J].
Ayoub, Noel F. ;
Lee, Yu-Jin ;
Grimm, David ;
Divi, Vasu .
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2024, 170 (06) :1484-1491
[10]   Potential Use of ChatGPT for Patient Information in Periodontology: A Descriptive Pilot Study [J].
Babayigit, Osman ;
Eroglu, Zeynep Tastan ;
Sen, Dilek Ozkan ;
Yarkac, Fatma Ucan .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (11)