A Thorough Examination on Zero-shot Dense Retrieval

被引:0
|
作者
Ren, Ruiyang [1 ,3 ]
Qu, Yingqi [2 ]
Liu, Jing [2 ]
Zhao, Wayne Xin [1 ,3 ]
Wu, Qifei [2 ]
Ding, Yuchen [2 ]
Wu, Hua [2 ]
Wang, Haifeng [2 ]
Wen, Ji-Rong [1 ,3 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
[2] Baidu Inc, Beijing, Peoples R China
[3] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the significant advance in dense retrieval (DR) based on powerful pre-trained language models (PLM). DR models have achieved excellent performance in several benchmark datasets, while they are shown to be not as competitive as traditional sparse retrieval models (e.g., BM25) in a zero-shot retrieval setting. However, in the related literature, there still lacks a detailed and comprehensive study on zero-shot retrieval. In this paper, we present the first thorough examination of the zero-shot capability of DR models. We aim to identify the key factors and analyze how they affect zero-shot retrieval performance. In particular, we discuss the effect of several key factors related to source training set, analyze the potential bias from the target dataset, and review and compare existing zero-shot DR models. Our findings provide important evidence to better understand and develop zero-shot DR models.
引用
收藏
页码:15783 / 15796
页数:14
相关论文
共 50 条
  • [31] Robust Retrieval Augmented Generation for Zero-shot Slot Filling
    Glass, Michael
    Rossiello, Gaetano
    Chowdhury, Md Faisal Mahbub
    Gliozzo, Alfio
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1939 - 1949
  • [32] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
  • [33] How Train-Test Leakage Affects Zero-Shot Retrieval
    Froebel, Maik
    Akiki, Christopher
    Potthast, Martin
    Hagen, Matthias
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2022, 2022, 13617 : 147 - 161
  • [34] Zero-Shot Learning to Index on Semantic Trees for Scalable Image Retrieval
    Kan, Shichao
    Cen, Yi
    Cen, Yigang
    Vladimir, Mladenovic
    Li, Yang
    He, Zhihai
    IEEE Transactions on Image Processing, 2021, 30 : 501 - 516
  • [35] EntroCap: Zero-shot image captioning with entropy-based retrieval
    Yan, Jie
    Xie, Yuxiang
    Zou, Shiwei
    Wei, Yingmei
    Luan, Xidao
    NEUROCOMPUTING, 2025, 611
  • [36] Generative Model for Zero-Shot Sketch-Based Image Retrieval
    Verma, Vinay Kumar
    Mishra, Aakansha
    Mishra, Ashish
    Rai, Piyush
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 704 - 713
  • [37] Zero-Shot Learning to Index on Semantic Trees for Scalable Image Retrieval
    Kan, Shichao
    Cen, Yi
    Cen, Yigang
    Vladimir, Mladenovic
    Li, Yang
    He, Zhihai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 501 - 516
  • [38] Zero-Shot Video Moment Retrieval With Angular Reconstructive Text Embeddings
    Jiang, Xun
    Xu, Xing
    Zhou, Zailei
    Yang, Yang
    Shen, Fumin
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9657 - 9670
  • [39] Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval
    Pandey, Anubha
    Mishra, Ashish
    Verma, Vinay Kumar
    Mittal, Anurag
    Murthy, Hema A.
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2529 - 2538
  • [40] Zero-Shot Sketch Based Image Retrieval Using Graph Transformer
    Gupta, Sumrit
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Kumar, Saurabh
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1685 - 1691