BiomedRAG: A retrieval augmented large language model for biomedicine

被引:8
作者
Li, Mingchen [1 ]
Kilicoglu, Halil [2 ]
Xu, Hua [3 ]
Zhang, Rui [1 ]
机构
[1] Univ Minnesota, Dept Surg, Div Computat Hlth Sci, Minneapolis, MN 55455 USA
[2] Univ Illinois, Sch Informat Sci, Champaign, IL USA
[3] Yale Univ, Sch Med, Dept Biomed Informat & Data Sci, New Haven, CT USA
基金
美国国家卫生研究院;
关键词
Retrieval-augmented generation; Large language model; EXTRACTION;
D O I
10.1016/j.jbi.2024.104769
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Retrieval-augmented generation (RAG) involves a solution by retrieving knowledge from an established database to enhance the performance of large language models (LLM). , these models retrieve information at the sentence or paragraph level, potentially introducing noise and affecting the generation quality. To address these issues, we propose a novel BiomedRAG framework that directly feeds automatically retrieved chunk-based documents into the LLM. Our evaluation of BiomedRAG across four biomedical natural language processing tasks using eight datasets demonstrates that our proposed framework not only improves the performance by 9.95% on average, but also achieves state-of-the-art results, surpassing various baselines by 4.97%. BiomedRAG paves the way for more accurate and adaptable LLM applications in the biomedical domain.
引用
收藏
页数:11
相关论文
共 50 条
[31]   Large Language Model Powered Agents for Information Retrieval [J].
Zhang, An ;
Deng, Yang ;
Lin, Yankai ;
Chen, Xu ;
Wen, Ji-Rong ;
Chua, Tat-Seng .
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, :2989-2992
[32]   RAMIE: retrieval-augmented multi-task information extraction with large language models on dietary supplements [J].
Zhan, Zaifu ;
Zhou, Shuang ;
Li, Mingchen ;
Zhang, Rui .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025, 32 (03) :545-554
[33]   Combining Lexicon Definitions and the Retrieval-Augmented Generation of a Large Language Model for the Automatic Annotation of Ancient Chinese Poetry [J].
Li, Jiabin ;
Wei, Tingxin ;
Qu, Weiguang ;
Li, Bin ;
Feng, Minxuan ;
Wang, Dongbo .
MATHEMATICS, 2025, 13 (12)
[34]   LARGE LANGUAGE MODELS (LLMS) AND CHATGPT FOR BIOMEDICINE [J].
Arighi, Cecilia ;
Brenner, Steven ;
Lu, Zhiyong .
BIOCOMPUTING 2024, PSB 2024, 2024, :641-644
[35]   Intelligent Orientation Robot Based on Large Language Models and Retrieval-Augmented Generation [J].
Deng, Xinying ;
Yang, Dongju ;
Zhang, Yuan .
2024 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, ICAICE, 2024, :779-782
[36]   ANDROMEDA: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models [J].
Wang, Pengyi ;
Chen, Sibei ;
Fan, Ju ;
Wu, Bin ;
Tang, Nan ;
Tan, Jian .
COMPANION OF THE 2025 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2025, 2025, :243-246
[37]   Enhancing textual textbook question answering with large language models and retrieval augmented generation [J].
Alawwad, Hessa A. ;
Alhothali, Areej ;
Naseem, Usman ;
Alkhathlan, Ali ;
Jamal, Amani .
PATTERN RECOGNITION, 2025, 162
[38]   ReAPR: Automatic program repair via retrieval-augmented large language models [J].
Liu, Zixin ;
Du, Xiaozhi ;
Liu, Hairui .
SOFTWARE QUALITY JOURNAL, 2025, 33 (03)
[39]   MMRAG: multi-mode retrieval-augmented generation with large language models for biomedical in-context learning [J].
Zhan, Zaifu ;
Wang, Jun ;
Zhou, Shuang ;
Deng, Jiawen ;
Zhang, Rui .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025,
[40]   Answering real-world clinical questions using large language model, retrieval-augmented generation, and agentic systems [J].
Low, Yen Sia ;
Jackson, Michael L. ;
Hyde, Rebecca J. ;
Brown, Robert E. ;
Sanghavi, Neil M. ;
Baldwin, Julian D. ;
Pike, C. William ;
Muralidharan, Jananee ;
Hui, Gavin ;
Alexander, Natasha ;
Hassan, Hadeel ;
Nene, Rahul, V ;
Pike, Morgan ;
Pokrzywa, Courtney J. ;
Vedak, Shivam ;
Yan, Adam Paul ;
Yao, Dong-han ;
Zipursky, Amy R. ;
Dinh, Christina ;
Ballentine, Philip ;
Derieg, Dan C. ;
Polony, Vladimir ;
Chawdry, Rehan N. ;
Davies, Jordan ;
Hyde, Brigham B. ;
Shah, Nigam H. ;
Gombar, Saurabh .
DIGITAL HEALTH, 2025, 11