Unsupervised Question Clarity Prediction Through Retrieved Item Coherency

被引:11
作者
Arabzadeh, Negar [1 ]
Seifikar, Mahsa [1 ]
Clarke, Charles L. A. [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022 | 2022年
关键词
Ambiguous Queries; Clarifying Questions; Retrieval Coherency;
D O I
10.1145/3511808.3557719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite recent progress on conversational systems, they still do not perform smoothly when faced with ambiguous requests. When questions are unclear, conversational systems should have the ability to ask clarifying questions, rather than assuming a particular interpretation or simply responding that they do not understand. While the research community has paid substantial attention to the problem of predicting query ambiguity in traditional search contexts, researchers have paid relatively little attention to predicting when this ambiguity is sufficient to warrant clarification in the context of conversational systems. In this paper, we propose an unsupervised method for predicting the need for clarification. This method is based on the measured coherency of results from an initial answer retrieval step, under the assumption that a less ambiguous query is more likely to retrieve more coherent results when compared to an ambiguous query. We build a graph from retrieved items based on their context similarity, treating measures of graph connectivity as indicators of ambiguity. We evaluate our approach on two open-domain conversational question answering datasets, ClariQ and AmbigNQ, comparing it with neural and non-neural baselines. Our unsupervised approach performs as well as supervised approaches while providing better generalization.
引用
收藏
页码:3811 / 3816
页数:6
相关论文
共 52 条
  • [1] Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
    Aliannejadi, Mohammad
    Zamani, Hamed
    Crestani, Fabio
    Croft, W. Bruce
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 475 - 484
  • [2] Aliannejadi Mohammad, 2021, ARXIV210905794
  • [3] [Anonymous], 2008, P 31 ANN INT ACM SIG, DOI DOI 10.1145/1390334.1390420
  • [4] [Anonymous], 2018, ARXIV180504655
  • [5] [Anonymous], 2017, RANLP
  • [6] Arabzadeh Negar, 2021, Advances in Information Retrieval. 43rd European Conference on IR Research, ECIR 2021. Proceedings. Lecture Notes in Computer Science (LNCS 12657), P193, DOI 10.1007/978-3-030-72240-1_15
  • [7] MS MARCO Chameleons: Challenging the MS MARCO Leaderboard with Extremely Obstinate Queries
    Arabzadeh, Negar
    Mitra, Bhaskar
    Bagheri, Ebrahim
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4426 - 4435
  • [8] BERT-QPP: Contextualized Pre-trained Transformers for Query Performance Prediction
    Arabzadeh, Negar
    Khodabakhsh, Maryam
    Bagheri, Ebrahim
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2857 - 2861
  • [9] Geometric Estimation of Specificity within Embedding Spaces
    Arabzadeh, Negar
    Zarrinkalam, Fattane
    Jovanovic, Jelena
    Bagheri, Ebrahim
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2109 - 2112
  • [10] Neural embedding-based specificity metrics for pre-retrieval query performance prediction
    Arabzadeh, Negar
    Zarrinkalam, Fattane
    Jovanovic, Jelena
    Al-Obeidat, Feras
    Bagheri, Ebrahim
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)