Testing the performance of spoken dialogue systems by means of an artificially simulated user

被引:18
作者
Lopez-Cozar, Ramon [1 ]
Callejas, Zoraida [1 ]
McTear, Michael [2 ]
机构
[1] Univ Granada, Dept Languages & Comp Syst, Fac Comp Sci, E-18071 Granada, Spain
[2] Univ Ulster, Sch Comp & Math, Newtownabbey, North Ireland
关键词
spoken dialogue systems; speech recognition; speech understanding; user simulation; artificial intelligence; natural language processing; robust human-computer interaction;
D O I
10.1007/s10462-007-9059-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new technique to test the performance of spoken dialogue systems by artificially simulating the behaviour of three types of user (very cooperative, cooperative and not very cooperative) interacting with a system by means of spoken dialogues. Experiments using the technique were carried out to test the performance of a previously developed dialogue system designed for the fast-food domain and working with two kinds of language model for automatic speech recognition: one based on 17 prompt-dependent language models, and the other based on one prompt-independent language model. The use of the simulated user enables the identification of problems relating to the speech recognition, spoken language understanding, and dialogue management components of the system. In particular, in these experiments problems were encountered with the recognition and understanding of postal codes and addresses and with the lengthy sequences of repetitive confirmation turns required to correct these errors. By employing a simulated user in a range of different experimental conditions sufficient data can be generated to support a systematic analysis of potential problems and to enable fine-grained tuning of the system.
引用
收藏
页码:291 / 323
页数:33
相关论文
共 50 条
  • [41] Online Learning of Attributed Bi-Automata for Dialogue Management in Spoken Dialogue Systems
    Serras, Manex
    Ines Torres, Maria
    Del Pozo, Arantza
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 22 - 31
  • [42] A Toolkit for the Evaluation of Spoken Dialogue Systems in Ambient Intelligence Domains
    Abalos, Nieves
    Espejo, Gonzalo
    Lopez-Cozar, Ramon
    Callejas, Zoraida
    Griol, David
    [J]. WORKSHOP PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS, 2011, 10 : 384 - 394
  • [43] Incorporating discourse features into confidence scoring of intention recognition results in spoken dialogue systems
    Higashinaka, R
    Sudoh, K
    Nakano, M
    [J]. SPEECH COMMUNICATION, 2006, 48 (3-4) : 417 - 436
  • [44] Root Cause Analysis of Miscommunication Hotspots in Spoken Dialogue Systems
    Georgiladakis, Spiros
    Athanasopoulou, Georgia
    Meena, Raveesh
    Lopes, Jose
    Chorianopoulou, Arodami
    Palogiannidi, Elisavet
    Iosif, Elias
    Skantze, Gabriel
    Potamianos, Alexandros
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1156 - 1160
  • [45] Application of Hidden Topic Markov Models on Spoken Dialogue Systems
    Chinaei, Hamid R.
    Chaib-draa, Brahim
    Lamontagne, Luc
    [J]. AGENTS AND ARTIFICIAL INTELLIGENCE, 2010, 67 : 151 - 163
  • [46] Reinforcement learning for parameter estimation in statistical spoken dialogue systems
    Jurcicek, Filip
    Thomson, Blaise
    Young, Steve
    [J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (03) : 168 - 192
  • [47] Detecting Repetitions in Spoken Dialogue Systems Using Phonetic Distances
    Lopes, Jose
    Salvi, Giampiero
    Skantze, Gabriel
    Abad, Alberto
    Gustafson, Joakim
    Batista, Fernando
    Meena, Raveesh
    Trancoso, Isabel
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1805 - 1809
  • [48] Statistical Methods for Building Robust Spoken Dialogue Systems in an Automobile
    Tsiakoulis, Pirros
    Gasic, Milica
    Henderson, Matthew
    Planells-Lerma, Joaquin
    Prombonas, Jorge
    Thomson, Blaise
    Yu, Kai
    Young, Steve
    Tzirkel, Eli
    [J]. ADVANCES IN HUMAN ASPECTS OF ROAD AND RAIL TRANSPORTATION, 2013, : 744 - 753
  • [49] Expanding Vocabulary for Recognizing User's Abbreviations of Proper Nouns without Increasing ASR Error Rates in Spoken Dialogue Systems
    Katsumaru, Masaki
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 187 - 190
  • [50] Real user evaluation of a POMDP spoken dialogue system using automatic belief compression
    Crook, Paul A.
    Keizer, Simon
    Wang, Zhuoran
    Tang, Wenshuo
    Lemon, Oliver
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (04) : 873 - 887