Improving large language model applications in biomedicine with retrieval-augmented generation: a systematic review, meta-analysis, and clinical development guidelines

被引：2

作者：

Liu, Siru ^{[1
,2
]}

Mccoy, Allison B. ^{[1
]}

Wright, Adam ^{[1
,3
]}

机构：

[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, 2525 West End Ave 1475, Nashville, TN 37212 USA

[2] Vanderbilt Univ, Dept Comp Sci, Nashville, TN 37212 USA

[3] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37212 USA

来源：

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION | 2025年 / 32卷 / 04期

基金：

美国国家卫生研究院;

关键词：

large language model; retrieval augmented generation; systematic review; meta-analysis; BIAS;

D O I：

10.1093/jamia/ocaf008

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Objective The objectives of this study are to synthesize findings from recent research of retrieval-augmented generation (RAG) and large language models (LLMs) in biomedicine and provide clinical development guidelines to improve effectiveness.Materials and Methods We conducted a systematic literature review and a meta-analysis. The report was created in adherence to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2020 analysis. Searches were performed in 3 databases (PubMed, Embase, PsycINFO) using terms related to "retrieval augmented generation" and "large language model," for articles published in 2023 and 2024. We selected studies that compared baseline LLM performance with RAG performance. We developed a random-effect meta-analysis model, using odds ratio as the effect size.Results Among 335 studies, 20 were included in this literature review. The pooled effect size was 1.35, with a 95% confidence interval of 1.19-1.53, indicating a statistically significant effect (P = .001). We reported clinical tasks, baseline LLMs, retrieval sources and strategies, as well as evaluation methods.Discussion Building on our literature review, we developed Guidelines for Unified Implementation and Development of Enhanced LLM Applications with RAG in Clinical Settings to inform clinical applications using RAG.Conclusion Overall, RAG implementation showed a 1.35 odds ratio increase in performance compared to baseline LLMs. Future research should focus on (1) system-level enhancement: the combination of RAG and agent, (2) knowledge-level enhancement: deep integration of knowledge into LLM, and (3) integration-level enhancement: integrating RAG systems within electronic health records.

引用

页码：605 / 615

页数：11

共 50 条

[31] Empirical Evidence of the Metacognitive Model of Rumination and Depression in Clinical and Nonclinical Samples: A Systematic Review and Meta-Analysis
Cano-Lopez, Julia B.
Garcia-Sancho, Esperanza
Fernandez-Castilla, Belen
Salguero, Jose M.
COGNITIVE THERAPY AND RESEARCH, 2022, 46 (02) : 367 - 392
[32] Chemotherapeutic potential of lupeol against cancer in pre-clinical model: A systematic review and meta-analysis
Fatma, Homa
Jameel, Mohd
Siddiqui, Arif Jamal
Kuddus, Mohammed
Buali, Nouha Saleh
Bahrini, Insaf
Siddique, Hifzur R.
PHYTOMEDICINE, 2024, 132
[33] Is the Wellness Recovery Action Plan (WRAP) Efficacious for Improving Personal and Clinical Recovery Outcomes? A Systematic Review and Meta-Analysis
Canacott, Louise
Moghaddam, Nima
Tickle, Anna
PSYCHIATRIC REHABILITATION JOURNAL, 2019, 42 (04) : 372 - 381
[34] Clinical and diagnostic values of metagenomic next-generation sequencing for infection in hematology patients: a systematic review and meta-analysis
Yuhui Chen
Jinjin Wang
Ting Niu
BMC Infectious Diseases, 24
[35] Statin use and development of atrial fibrillation: A systematic review and meta-analysis of randomized clinical trials and observational studies
Liu, Tong
Li, Lijian
Korantzopoulos, Panagiotis
Liu, Enzhao
Li, Guangping
INTERNATIONAL JOURNAL OF CARDIOLOGY, 2008, 126 (02) : 160 - 170
[36] Augmented expression of polo-like kinase 1 indicates poor clinical outcome for breast patients: a systematic review and meta-analysis
Zhang, Yunfeng
Wu, Zhibin
Liu, Dapeng
Wang, Meng
Xiao, Guodong
Wang, Peili
Sun, Xin
Ren, Hong
Tang, Shou-Ching
Du, Ning
ONCOTARGET, 2017, 8 (34) : 57723 - 57732
[37] Clinical and diagnostic values of metagenomic next-generation sequencing for infection in hematology patients: a systematic review and meta-analysis
Chen, Yuhui
Wang, Jinjin
Niu, Ting
BMC INFECTIOUS DISEASES, 2024, 24 (01)
[38] Clinical efficacy of single and multiple applications of antimicrobial photodynamic therapy in periodontal maintenance: A systematic review and network meta-analysis
Ramanauskaite, Egle
Moraschini, Vittorio
Machiulskiene, Vita
Sculean, Anton
PHOTODIAGNOSIS AND PHOTODYNAMIC THERAPY, 2021, 36
[39] Efficacy of Acellular Dermal Matrix in Improving Clinical Outcomes in Pediatric Burns: A Systematic Review and Meta-Analysis of Randomized Controlled Trials
Lou, Jiaqi
Zhu, Xiaoyu
Xiang, Ziyi
Song, Jingyao
Huang, Neng
Jin, Guoying
Cui, Shengyong
Fan, Youfen
Li, Jiliang
JOURNAL OF PEDIATRIC SURGERY, 2025, 60 (05)
[40] The Efficacy of Postoperative Iron Therapy in Improving Clinical and Patient-Centered Outcomes Following Surgery: A Systematic Review and Meta-Analysis
Perelman, Iris
Winter, Remington
Sikora, Lindsey
Martel, Guillaume
Saidenberg, Elianna
Fergusson, Dean
TRANSFUSION MEDICINE REVIEWS, 2018, 32 (02) : 89 - 101

← 1 2 3 4 5 →