"HOW ROBUST R U?": EVALUATING TASK-ORIENTED DIALOGUE SYSTEMS ON SPOKEN CONVERSATIONS

被引:3
|
作者
Kim, Seokhwan [1 ]
Liu, Yang [1 ]
Fin, Di [1 ]
Papangelis, Alexandros [1 ]
Gopalakrishnan, Karthik [1 ]
Hedayatnia, Behnam [1 ]
Hakkani-Tur, Dilek [1 ]
机构
[1] Amazon Alexa AI, Sunnyvale, CA 94089 USA
来源
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年
关键词
spoken dialogue systems; dialogue state tracking; knowledge-grounded dialogue generation; NETWORKS;
D O I
10.1109/ASRU51503.2021.9688274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study multi-domain dialogue state tracking and knowledge-grounded dialogue modeling. We report that the existing state-of-the-art models trained on written conversations are not performing well on our spoken data, as expected. Furthermore, we observe improvements in task performances when leveraging n-best speech recognition hypotheses such as by combining predictions based on individual hypotheses. Our data set enables speech-based bench-marking of task-oriented dialogue systems.
引用
收藏
页码:1147 / 1154
页数:8
相关论文
共 16 条
  • [1] Robust Cross-lingual Task-oriented Dialogue
    Xiang, Lu
    Zhu, Junnan
    Zhao, Yang
    Zhou, Yu
    Zong, Chengqing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [2] Model discrepancy policy optimization for task-oriented dialogue
    Zhou, Zhenyou
    Liu, Zhibin
    Dong, Zhaoan
    Liu, Yuhan
    COMPUTER SPEECH AND LANGUAGE, 2024, 87
  • [3] LEARNING CONCEPTS THROUGH CONVERSATIONS IN SPOKEN DIALOGUE SYSTEMS
    Jia, Robin
    Heck, Larry
    Hakkani-Tur, Dilek
    Nikolov, Georgi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5725 - 5729
  • [4] Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues
    Paul, Shachi
    Goel, Rahul
    Hakkani-Tur, Dilek
    INTERSPEECH 2019, 2019, : 1453 - 1457
  • [5] MultiWOZ-PT: A Task-oriented Dialogue Dataset in Portuguese
    Ferreira, Patricia
    Pais, Francisco
    Silva, Catarina
    Alves, Ana
    Oliveira, Hugo Goncalo
    LINGUAMATICA, 2024, 16 (02):
  • [6] Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-oriented Spoken Dialog
    Qian, Yao
    Shi, Yu
    Zeng, Michael
    INTERSPEECH 2020, 2020, : 3915 - 3919
  • [7] Towards a Flexible User Simulation for Evaluating Spoken Dialogue Systems
    Butenkov, Dmitry
    HUMAN-COMPUTER INTERACTION - INTERACT 2009, PT II, PROCEEDINGS, 2009, 5727 : 880 - 883
  • [8] Statistical Methods for Building Robust Spoken Dialogue Systems in an Automobile
    Tsiakoulis, Pirros
    Gasic, Milica
    Henderson, Matthew
    Planells-Lerma, Joaquin
    Prombonas, Jorge
    Thomson, Blaise
    Yu, Kai
    Young, Steve
    Tzirkel, Eli
    ADVANCES IN HUMAN ASPECTS OF ROAD AND RAIL TRANSPORTATION, 2013, : 744 - 753
  • [9] OSTOD: One-Step Task-Oriented Dialogue with activated state and retelling response
    Huang, Heyan
    Yang, Puhai
    Wei, Wei
    Shi, Shumin
    Mao, Xian-Ling
    KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [10] A Reinforcement Learning approach to evaluating state representations in spoken dialogue systems
    Tetreault, Joel R.
    Litman, Diane J.
    SPEECH COMMUNICATION, 2008, 50 (8-9) : 683 - 696