Improving large language model applications in biomedicine with retrieval-augmented generation: a systematic review, meta-analysis, and clinical development guidelines

被引:2
|
作者
Liu, Siru [1 ,2 ]
Mccoy, Allison B. [1 ]
Wright, Adam [1 ,3 ]
机构
[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, 2525 West End Ave 1475, Nashville, TN 37212 USA
[2] Vanderbilt Univ, Dept Comp Sci, Nashville, TN 37212 USA
[3] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37212 USA
基金
美国国家卫生研究院;
关键词
large language model; retrieval augmented generation; systematic review; meta-analysis; BIAS;
D O I
10.1093/jamia/ocaf008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective The objectives of this study are to synthesize findings from recent research of retrieval-augmented generation (RAG) and large language models (LLMs) in biomedicine and provide clinical development guidelines to improve effectiveness.Materials and Methods We conducted a systematic literature review and a meta-analysis. The report was created in adherence to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2020 analysis. Searches were performed in 3 databases (PubMed, Embase, PsycINFO) using terms related to "retrieval augmented generation" and "large language model," for articles published in 2023 and 2024. We selected studies that compared baseline LLM performance with RAG performance. We developed a random-effect meta-analysis model, using odds ratio as the effect size.Results Among 335 studies, 20 were included in this literature review. The pooled effect size was 1.35, with a 95% confidence interval of 1.19-1.53, indicating a statistically significant effect (P = .001). We reported clinical tasks, baseline LLMs, retrieval sources and strategies, as well as evaluation methods.Discussion Building on our literature review, we developed Guidelines for Unified Implementation and Development of Enhanced LLM Applications with RAG in Clinical Settings to inform clinical applications using RAG.Conclusion Overall, RAG implementation showed a 1.35 odds ratio increase in performance compared to baseline LLMs. Future research should focus on (1) system-level enhancement: the combination of RAG and agent, (2) knowledge-level enhancement: deep integration of knowledge into LLM, and (3) integration-level enhancement: integrating RAG systems within electronic health records.
引用
收藏
页码:605 / 615
页数:11
相关论文
共 50 条
  • [21] Systematic review and meta-analysis protocol for development and validation of a prediction model for gestational hypertension in Africa
    Feleke, Sefineh Fenta
    Dessie, Anteneh Mengist
    Tenaw, Denekew
    Yimer, Ali
    Geremew, Habtamu
    Mulatie, Rahel
    Kebede, Abayneh
    SAGE OPEN MEDICINE, 2023, 11
  • [22] The efficacy of N-acetylcysteine in improving liver function: A systematic review and meta-analysis of controlled clinical trials
    Nikbaf-Shandiz, Mahlagha
    Adeli, Shaghayegh
    Faghfouri, Amir Hossein
    Khademi, Fateme
    Jamilian, Parsa
    Zarezadeh, Meysam
    Ebrahimi-Mamaghani, Mehrangiz
    PHARMANUTRITION, 2023, 24
  • [23] Effectiveness of early rhythm control in improving clinical outcomes in patients with atrial fibrillation: a systematic review and meta-analysis
    Zhu, Wengen
    Wu, Zexuan
    Dong, Yugang
    Lip, Gregory Y. H.
    Liu, Chen
    BMC MEDICINE, 2022, 20 (01)
  • [24] Effectiveness of early rhythm control in improving clinical outcomes in patients with atrial fibrillation: a systematic review and meta-analysis
    Wengen Zhu
    Zexuan Wu
    Yugang Dong
    Gregory Y. H. Lip
    Chen Liu
    BMC Medicine, 20
  • [25] Art therapy as an adjuvant treatment for schizophrenia: A protocol for an updated systematic review and subgroup meta-analysis of randomized clinical trials following the PRISMA guidelines
    Luo, Xuexing
    Zhang, Zheyu
    Zheng, Zhong
    Ye, Qian
    Wang, Jue
    Wu, Qibiao
    Huang, Guanghui
    MEDICINE, 2022, 101 (40) : E30935
  • [26] Uterine-preserving surgeries for the repair of pelvic organ prolapse: a systematic review with meta-analysis and clinical practice guidelines
    Meriwether, Kate V.
    Balk, Ethan M.
    Antosh, Danielle D.
    Olivera, Cedric K.
    Kim-Fine, Shunaha
    Murphy, Miles
    Grimes, Cara L.
    Sleemi, Ambereen
    Singh, Ruchira
    Dieter, Alexis A.
    Crisp, Catrina C.
    Rahn, David D.
    INTERNATIONAL UROGYNECOLOGY JOURNAL, 2019, 30 (04) : 505 - 522
  • [27] Uterine-preserving surgeries for the repair of pelvic organ prolapse: a systematic review with meta-analysis and clinical practice guidelines
    Kate V. Meriwether
    Ethan M. Balk
    Danielle D. Antosh
    Cedric K. Olivera
    Shunaha Kim-Fine
    Miles Murphy
    Cara L. Grimes
    Ambereen Sleemi
    Ruchira Singh
    Alexis A. Dieter
    Catrina C. Crisp
    David D. Rahn
    International Urogynecology Journal, 2019, 30 : 505 - 522
  • [28] Sensory substitution for orthopaedic gait rehabilitation: A systematic review and meta-analysis for clinical practice guideline development
    Lynch, Peter
    Broderick, Patrick
    Monaghan, Kenneth
    HELIYON, 2022, 8 (10)
  • [29] Endovascular thrombectomy for large ischemic strokes: An updated living systematic review and meta-analysis of randomized clinical trials
    Morsi, Rami Z.
    Elfil, Mohamed
    Ghaith, Hazem S.
    Aladawi, Mohammad
    Elmashad, Ahmed
    Kothari, Sachin
    Desai, Harsh
    Ghozy, Sherief
    Prabhakaran, Shyam
    Amuluru, Krishna
    Gandhi, Chirag D.
    Kass-Hout, Tareq
    Al-Mufti, Fawaz
    JOURNAL OF THE NEUROLOGICAL SCIENCES, 2024, 460
  • [30] Empirical Evidence of the Metacognitive Model of Rumination and Depression in Clinical and Nonclinical Samples: A Systematic Review and Meta-Analysis
    Julia B. Cano-López
    Esperanza García-Sancho
    Belén Fernández-Castilla
    José M. Salguero
    Cognitive Therapy and Research, 2022, 46 : 367 - 392