"HOW ROBUST R U?": EVALUATING TASK-ORIENTED DIALOGUE SYSTEMS ON SPOKEN CONVERSATIONS

被引：3

作者：

Kim, Seokhwan ^{[1
]}

Liu, Yang ^{[1
]}

Fin, Di ^{[1
]}

Papangelis, Alexandros ^{[1
]}

Gopalakrishnan, Karthik ^{[1
]}

Hedayatnia, Behnam ^{[1
]}

Hakkani-Tur, Dilek ^{[1
]}

机构：

[1] Amazon Alexa AI, Sunnyvale, CA 94089 USA

来源：

2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年

关键词：

spoken dialogue systems; dialogue state tracking; knowledge-grounded dialogue generation; NETWORKS;

D O I：

10.1109/ASRU51503.2021.9688274

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study multi-domain dialogue state tracking and knowledge-grounded dialogue modeling. We report that the existing state-of-the-art models trained on written conversations are not performing well on our spoken data, as expected. Furthermore, we observe improvements in task performances when leveraging n-best speech recognition hypotheses such as by combining predictions based on individual hypotheses. Our data set enables speech-based bench-marking of task-oriented dialogue systems.

引用

页码：1147 / 1154

页数：8

共 16 条

[1] Robust Cross-lingual Task-oriented Dialogue
Xiang, Lu
Zhu, Junnan
Zhao, Yang
Zhou, Yu
Zong, Chengqing
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
[2] Model discrepancy policy optimization for task-oriented dialogue
Zhou, Zhenyou
Liu, Zhibin
Dong, Zhaoan
Liu, Yuhan
COMPUTER SPEECH AND LANGUAGE, 2024, 87
[3] LEARNING CONCEPTS THROUGH CONVERSATIONS IN SPOKEN DIALOGUE SYSTEMS
Jia, Robin
Heck, Larry
Hakkani-Tur, Dilek
Nikolov, Georgi
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5725 - 5729
[4] Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues
Paul, Shachi
Goel, Rahul
Hakkani-Tur, Dilek
INTERSPEECH 2019, 2019, : 1453 - 1457
[5] MultiWOZ-PT: A Task-oriented Dialogue Dataset in Portuguese
Ferreira, Patricia
Pais, Francisco
Silva, Catarina
Alves, Ana
Oliveira, Hugo Goncalo
LINGUAMATICA, 2024, 16 (02):
[6] Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-oriented Spoken Dialog
Qian, Yao
Shi, Yu
Zeng, Michael
INTERSPEECH 2020, 2020, : 3915 - 3919
[7] Towards a Flexible User Simulation for Evaluating Spoken Dialogue Systems
Butenkov, Dmitry
HUMAN-COMPUTER INTERACTION - INTERACT 2009, PT II, PROCEEDINGS, 2009, 5727 : 880 - 883
[8] Statistical Methods for Building Robust Spoken Dialogue Systems in an Automobile
Tsiakoulis, Pirros
Gasic, Milica
Henderson, Matthew
Planells-Lerma, Joaquin
Prombonas, Jorge
Thomson, Blaise
Yu, Kai
Young, Steve
Tzirkel, Eli
ADVANCES IN HUMAN ASPECTS OF ROAD AND RAIL TRANSPORTATION, 2013, : 744 - 753
[9] OSTOD: One-Step Task-Oriented Dialogue with activated state and retelling response
Huang, Heyan
Yang, Puhai
Wei, Wei
Shi, Shumin
Mao, Xian-Ling
KNOWLEDGE-BASED SYSTEMS, 2024, 293
[10] A Reinforcement Learning approach to evaluating state representations in spoken dialogue systems
Tetreault, Joel R.
Litman, Diane J.
SPEECH COMMUNICATION, 2008, 50 (8-9) : 683 - 696

← 1 2 →