Unsupervised Question Clarity Prediction Through Retrieved Item Coherency

被引:11
作者
Arabzadeh, Negar [1 ]
Seifikar, Mahsa [1 ]
Clarke, Charles L. A. [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022 | 2022年
关键词
Ambiguous Queries; Clarifying Questions; Retrieval Coherency;
D O I
10.1145/3511808.3557719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite recent progress on conversational systems, they still do not perform smoothly when faced with ambiguous requests. When questions are unclear, conversational systems should have the ability to ask clarifying questions, rather than assuming a particular interpretation or simply responding that they do not understand. While the research community has paid substantial attention to the problem of predicting query ambiguity in traditional search contexts, researchers have paid relatively little attention to predicting when this ambiguity is sufficient to warrant clarification in the context of conversational systems. In this paper, we propose an unsupervised method for predicting the need for clarification. This method is based on the measured coherency of results from an initial answer retrieval step, under the assumption that a less ambiguous query is more likely to retrieve more coherent results when compared to an ambiguous query. We build a graph from retrieved items based on their context similarity, treating measures of graph connectivity as indicators of ambiguity. We evaluate our approach on two open-domain conversational question answering datasets, ClariQ and AmbigNQ, comparing it with neural and non-neural baselines. Our unsupervised approach performs as well as supervised approaches while providing better generalization.
引用
收藏
页码:3811 / 3816
页数:6
相关论文
共 52 条
  • [31] Lin Jimmy, 2021, Synthesis Lectures on Human Language Technologies, V14, P1
  • [32] How Am I Doing?: Evaluating Conversational Search Systems Offline
    Lipani, Aldo
    Carterette, Ben
    Yilmaz, Emine
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2021, 39 (04)
  • [33] CASA: Correlation-Aware Speculative Adders
    Liu, Gai
    Tao, Ye
    Tan, Mingxing
    Zhang, Zhiru
    [J]. PROCEEDINGS OF THE 2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2014, : 189 - 194
  • [34] Liu Y., 2020, RoBERTa: a robustly optimized BERT pretraining approach
  • [35] How Deep is your Learning: the DL-HARD Annotated Deep Learning Dataset
    Mackie, Iain
    Dalton, Jeffrey
    Yates, Andrew
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2335 - 2341
  • [36] Min Sewon, 2020, ARXIV200410645
  • [37] Mitra B., 2018, An introduction to neural information
  • [38] Machines and mindlessness: Social responses to computers
    Nass, C
    Moon, Y
    [J]. JOURNAL OF SOCIAL ISSUES, 2000, 56 (01) : 81 - 103
  • [39] Putra Jan Wira Gotama, 2017, Proceedings of TextGraphs-11: the workshop on graph-based methods for natural language processing, P76, DOI DOI 10.18653/V1/W17-2410
  • [40] Roul Rajendra Kumar, 2012, ARXIV12041406