Few-Shot Conversational Dense Retrieval

被引:54
作者
Yu, Shi [1 ]
Liu, Zhenghao [1 ]
Xiong, Chenyan [2 ]
Feng, Tao [1 ]
Liu, Zhiyuan [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
来源
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2021年
关键词
Conversational Retrieval; Dense Retrieval; Knowledge Distillation;
D O I
10.1145/3404835.3462856
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dense retrieval (DR) has the potential to resolve the query understanding challenge in conversational search by matching in the learned embedding space. However, this adaptation is challenging due to DR models' extra needs for supervision signals and the longtail nature of conversational search. In this paper, we present a Conversational Dense Retrieval system, ConvDR, that learns contextualized embeddings for multi-turn conversational queries and retrieves documents solely using embedding dot products. In addition, we grant ConvDR few-shot ability using a teacher-student framework, where we employ an ad hoc dense retriever as the teacher, inherit its document encodings, and learn a student query encoder to mimic the teacher embeddings on oracle reformulated queries. Our experiments on TREC CAsT and OR-QuAC demonstrate ConvDR's effectiveness in both few-shot and fully-supervised settings. It outperforms previous systems that operate in the sparse word space, matches the retrieval accuracy of oracle query reformulations, and is also more efficient thanks to its simplicity. Our analyses reveal that the advantages of ConvDR come from its ability to capture informative context while ignoring the unrelated context in previous conversation rounds. This makes ConvDR more effective as conversations evolve while previous systems may get confused by the increased noise from previous turns.
引用
收藏
页码:829 / 838
页数:10
相关论文
共 48 条
  • [1] Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
    Aliannejadi, Mohammad
    Zamani, Hamed
    Crestani, Fabio
    Croft, W. Bruce
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 475 - 484
  • [2] [Anonymous], 2010, SEARCH ENGINES INFOR
  • [3] Bajaj P., 2016, ARXIV161109268CS
  • [4] Chen Qu Liu, 2020, P 43 INT ACM SIGIR C, P539, DOI DOI 10.1145/3397271.3401110
  • [5] Choi Eunsol, 2020, P EMNLP, P2174
  • [6] Reciprocal Rank Fusion outperforms Condorcet and Individual Rank Learning Methods
    Cormack, Gordon V.
    Clarke, Charles L. A.
    Buettcher, Stefan
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 758 - 759
  • [7] The Importance of Interaction for Information Retrieval
    Croft, W. Bruce
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1 - 1
  • [8] Culpepper J. S., 2018, ACM SIGIR FORUM, V52, P34
  • [9] Dalton Jeffrey, 2019, P TREC
  • [10] Dalton Jeffrey, 2020, P 43 INT ACM SIG C, P1985, DOI DOI 10.1145/1122445.1122456