Performance comparison of retrieval-augmented generation and fine-tuned large language models for construction safety management knowledge retrieval

被引:3
|
作者
Lee, Jungwon [1 ]
Ahn, Seungjun [1 ]
Kim, Daeho [2 ]
Kim, Dongkyun [1 ]
机构
[1] Hongik Univ, Dept Civil & Environm Engn, Seoul, South Korea
[2] Univ Toronto, Dept Civil & Mineral Engn, Toronto, ON, Canada
基金
新加坡国家研究基金会;
关键词
Large Language Model (LLM); Retrieval-Augmented Generation (RAG); Fine-tuned LLM; Construction safety; Knowledge graph;
D O I
10.1016/j.autcon.2024.105846
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Construction safety standards are in unstructured formats like text and images, complicating their effective use in daily tasks. This paper compares the performance of Retrieval-Augmented Generation (RAG) and fine-tuned Large Language Model (LLM) for the construction safety knowledge retrieval. The RAG model was created by integrating GPT-4 with a knowledge graph derived from construction safety guidelines, while the fine-tuned LLM was fine-tuned using a question-answering dataset derived from the same guidelines. These models' performance is tested through case studies, using accident synopses as a query to generate preventive measurements. The responses were assessed using metrics, including cosine similarity, Euclidean distance, BLEU, and ROUGE scores. It was found that both models outperformed GPT-4, with the RAG model improving by 21.5 % and the fine-tuned LLM by 26 %. The findings highlight the relative strengths and weaknesses of the RAG and fine-tuned LLM approaches in terms of applicability and reliability for safety management.
引用
收藏
页数:12
相关论文
empty
未找到相关数据