Enhancing knowledge retrieval with in-context learning and semantic search through generative AI

被引:0
|
作者
Ghali, Mohammed-Khalil [1 ]
Farrag, Abdelrahman [1 ]
Won, Daehan [1 ]
Jin, Yu [2 ]
机构
[1] SUNY Binghamton, Sch Syst Sci & Ind Engn, 4400 Vestal Pkwy, Binghamton, NY 13902 USA
[2] SUNY Buffalo, Dept Ind & Syst Engn, 210 Mary Talbert Way, Buffalo, NY 14260 USA
关键词
Generative AI; LLMs; In-context learning; Semantic search;
D O I
10.1016/j.knosys.2025.113047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrieving and extracting knowledge from sets of many complex research documents and large databases presents significant challenges in today's information-rich era. Existing retrieval systems, which rely on generalpurpose Large Language Models (LLMs), often fail to provide accurate responses to domain-specific inquiries. Additionally, the high cost of pretraining or finetuning LLMs for specific domains limits their adoption. To address those limitations, a novel methodology is proposed that combines the generative capabilities of LLMs with the fast and accurate retrieval capabilities of vector databases. This retrieval system can handle tabular and non-tabular data, understand natural language queries, and retrieve relevant information without finetuning. The developed model, Generative Text Retrieval (GTR), is adaptable to unstructured and structured data with minor refinement. GTR was evaluated on manually annotated and public datasets, achieving more than 90% accuracy and delivering truthful outputs in 87% of cases. The proposed model achieved state-ofthe-art performance with a Rouge-L F1 score of 0.98 on the MSMARCO dataset. A refined model, Generative Tabular Text Retrieval (GTR-T), demonstrated its efficiency in large database querying, achieving an Execution Accuracy (EX) of 0.82 and an Exact-Set-Match (EM) accuracy of 0.60 on the Spider dataset, using open-source LLM. Those efforts leverage generative AI and in-context learning to enhance human-text interaction.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Generative Calibration for In-context Learning
    Jiang, Zhongtao
    Zhang, Yuanzhe
    Liu, Cao
    Zhao, Jun
    Liu, Kang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2312 - 2333
  • [2] Enhancing Reasoning Ability in Semantic Communication Through Generative AI-Assisted Knowledge Construction
    Zhao, Fangzhou
    Sun, Yao
    Feng, Lei
    Zhang, Lan
    Zhao, Dezong
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (04) : 832 - 836
  • [3] Identifying Semantic Induction Heads to Understand In-Context Learning
    Ren, Jie
    Guo, Qipeng
    Yan, Hang
    Liu, Dongrui
    Zhang, Quanshi
    Qiu, Xipeng
    Lin, Dahua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 6916 - 6932
  • [4] Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models
    Han, Dongrui
    Cui, Mingyu
    Kang, Jiawen
    Wu, Xixin
    Liu, Xunying
    Meng, Helen
    2024 IEEE 14TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, ISCSLP 2024, 2024, : 631 - 635
  • [5] Enhancing In-context Learning via Linear Probe Calibration
    Abbas, Momin
    Zhou, Yi
    Ram, Parikshit
    Baracaldo, Nathalie
    Samulowitz, Horst
    Salonidis, Theodoros
    Chen, Tianyi
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [6] Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-Context Learning
    Zhou, Hao
    Hu, Chengming
    Yuan, Dun
    Yuan, Ye
    Wu, Di
    Liu, Xue
    Han, Zhu
    Zhang, Jianzhong
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2025, 14 (03) : 711 - 715
  • [7] A Generative Context Model for Semantic Music Annotation and Retrieval
    Miotto, Riccardo
    Lanckriet, Gert
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1096 - 1108
  • [8] Enhancing Chinese Address Parsing in Low-Resource Scenarios through In-Context Learning
    Ling, Guangming
    Mu, Xiaofeng
    Wang, Chao
    Xu, Aiping
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (07)
  • [9] Exploring In-Context Learning for Knowledge Grounded Dialog Generation
    Chen, Qinyu
    Wu, Wenhao
    Li, Sujian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10071 - 10081
  • [10] Can We Edit Factual Knowledge by In-Context Learning?
    Zheng, Ce
    Li, Lei
    Dong, Qingxiu
    Fan, Yuxuan
    Wu, Zhiyong
    Xu, Jingjing
    Chang, Baobao
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4862 - 4876