Unsupervised Question Clarity Prediction Through Retrieved Item Coherency

被引：11

作者：

Arabzadeh, Negar ^{[1
]}

Seifikar, Mahsa ^{[1
]}

Clarke, Charles L. A. ^{[1
]}

机构：

[1] Univ Waterloo, Waterloo, ON, Canada

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022 | 2022年

关键词：

Ambiguous Queries; Clarifying Questions; Retrieval Coherency;

D O I：

10.1145/3511808.3557719

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Despite recent progress on conversational systems, they still do not perform smoothly when faced with ambiguous requests. When questions are unclear, conversational systems should have the ability to ask clarifying questions, rather than assuming a particular interpretation or simply responding that they do not understand. While the research community has paid substantial attention to the problem of predicting query ambiguity in traditional search contexts, researchers have paid relatively little attention to predicting when this ambiguity is sufficient to warrant clarification in the context of conversational systems. In this paper, we propose an unsupervised method for predicting the need for clarification. This method is based on the measured coherency of results from an initial answer retrieval step, under the assumption that a less ambiguous query is more likely to retrieve more coherent results when compared to an ambiguous query. We build a graph from retrieved items based on their context similarity, treating measures of graph connectivity as indicators of ambiguity. We evaluate our approach on two open-domain conversational question answering datasets, ClariQ and AmbigNQ, comparing it with neural and non-neural baselines. Our unsupervised approach performs as well as supervised approaches while providing better generalization.

引用

页码：3811 / 3816

页数：6

共 52 条

[31] Lin Jimmy, 2021, Synthesis Lectures on Human Language Technologies, V14, P1
[32] How Am I Doing?: Evaluating Conversational Search Systems Offline
Lipani, Aldo
Carterette, Ben
Yilmaz, Emine
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2021, 39 (04)
[33] CASA: Correlation-Aware Speculative Adders
Liu, Gai
Tao, Ye
Tan, Mingxing
Zhang, Zhiru
[J]. PROCEEDINGS OF THE 2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2014, : 189 - 194
[34] Liu Y., 2020, RoBERTa: a robustly optimized BERT pretraining approach
[35] How Deep is your Learning: the DL-HARD Annotated Deep Learning Dataset
Mackie, Iain
Dalton, Jeffrey
Yates, Andrew
[J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2335 - 2341
[36] Min Sewon, 2020, ARXIV200410645
[37] Mitra B., 2018, An introduction to neural information
[38] Machines and mindlessness: Social responses to computers
Nass, C
Moon, Y
[J]. JOURNAL OF SOCIAL ISSUES, 2000, 56 (01) : 81 - 103
[39] Putra Jan Wira Gotama, 2017, Proceedings of TextGraphs-11: the workshop on graph-based methods for natural language processing, P76, DOI DOI 10.18653/V1/W17-2410
[40] Roul Rajendra Kumar, 2012, ARXIV12041406

← 1 2 3 4 5 6 →