Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback

被引:0
|
作者
Zhou, Yujia [1 ]
Dou, Zhicheng [2 ]
Wen, Ji-Rong [1 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[2] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent advent of end-to-end generative retrieval marks a significant shift in document retrieval methods, leveraging differentiable search indexes to directly produce relevant document identifiers (docids) in response to a specific query. Nevertheless, this approach faces two fundamental challenges: (i) a discrepancy between the token-level probabilistic optimization and the broader document-level relevance estimation; (ii) an overemphasis on top-1 results at the expense of overall ranking quality. To tackle these challenges, we propose a generative retrieval model with reinforcement learning from relevance feedback, which aims to align token-level docid generation with document-level relevance estimation. The training process incorporates three stages: supervised fine-tuning, relevance reward model training, and reinforced learning-to-rank from relevance feedback. To train a high-quality reward model, we define "relevance" under three progressive scenarios, which collectively offer a comprehensive evaluation of the document relevance. Experiments conducted on two benchmark datasets demonstrate the effectiveness of our proposed approach.
引用
收藏
页码:12481 / 12490
页数:10
相关论文
共 50 条
  • [1] Integrating relevance feedback techniques for image retrieval using reinforcement learning
    Yin, PY
    Bhanu, B
    Chang, KC
    Dong, A
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (10) : 1536 - 1551
  • [2] A Reinforcement Learning Framework for Relevance Feedback
    Montazeralghaem, Ali
    Zamani, Hamed
    Allan, James
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 59 - 68
  • [3] Learning region weighting from relevance feedback in image retrieval
    Jing, F
    Li, MJ
    Zhang, HJ
    Zhang, B
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4088 - 4091
  • [4] Relevance feedback with active learning for document retrieval
    Onoda, T
    Murata, H
    Yamada, S
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1757 - 1762
  • [5] Content-Based Image Retrieval Based on Relevance Feedback and Reinforcement Learning for Medical Images
    Lakdashti, Abolfazl
    Ajorloo, Hossein
    ETRI JOURNAL, 2011, 33 (02) : 240 - 250
  • [6] Reinforcement learning for combining relevance feedback techniques
    Yin, PY
    Bhanu, B
    Chang, KC
    Dong, A
    NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 510 - 515
  • [7] Reinforcement learning using negative relevance feedback
    Son, Kijun
    Lee, JaeAn
    Park, SeongBae
    Lee, SangJo
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 559 - +
  • [8] Enhancing query translation with relevance feedback in translingual information retrieval
    He, Daqing
    Wu, Dan
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (01) : 1 - 17
  • [9] Enhancing relevance feedback in image retrieval using unlabeled data
    Zhou, Zhi-Hua
    Chen, Ke-Jia
    Dai, Hong-Bin
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2006, 24 (02) : 219 - 244
  • [10] Learning Weighted Distances for Relevance Feedback in Image Retrieval
    Deselaers, Thomas
    Paredes, Roberto
    Vidal, Enrique
    Ney, Hermann
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2100 - 2103