Testing the performance of spoken dialogue systems by means of an artificially simulated user

被引:18
作者
Lopez-Cozar, Ramon [1 ]
Callejas, Zoraida [1 ]
McTear, Michael [2 ]
机构
[1] Univ Granada, Dept Languages & Comp Syst, Fac Comp Sci, E-18071 Granada, Spain
[2] Univ Ulster, Sch Comp & Math, Newtownabbey, North Ireland
关键词
spoken dialogue systems; speech recognition; speech understanding; user simulation; artificial intelligence; natural language processing; robust human-computer interaction;
D O I
10.1007/s10462-007-9059-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new technique to test the performance of spoken dialogue systems by artificially simulating the behaviour of three types of user (very cooperative, cooperative and not very cooperative) interacting with a system by means of spoken dialogues. Experiments using the technique were carried out to test the performance of a previously developed dialogue system designed for the fast-food domain and working with two kinds of language model for automatic speech recognition: one based on 17 prompt-dependent language models, and the other based on one prompt-independent language model. The use of the simulated user enables the identification of problems relating to the speech recognition, spoken language understanding, and dialogue management components of the system. In particular, in these experiments problems were encountered with the recognition and understanding of postal codes and addresses and with the lengthy sequences of repetitive confirmation turns required to correct these errors. By employing a simulated user in a range of different experimental conditions sufficient data can be generated to support a systematic analysis of potential problems and to enable fine-grained tuning of the system.
引用
收藏
页码:291 / 323
页数:33
相关论文
共 50 条
  • [21] Enhancement of the Input Interface of Spoken Dialogue Systems By Means of Contextual Models and Grammatical Rules
    Lopez-Cozar, Ramon
    Callejas, Zoraida
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (43): : 113 - 120
  • [22] Spoken dialogue technology: Enabling the conversational user interface
    McTear, MF
    ACM COMPUTING SURVEYS, 2002, 34 (01) : 90 - 169
  • [23] Providing personalized Internet services by means of context-aware spoken dialogue systems
    Griol, David
    Manuel Molina, Jose
    Callejas, Zoraida
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2013, 5 (01) : 23 - 45
  • [24] New Technique to Enhance the Performance of Spoken Dialogue Systems Based on Dialogue States-Dependent Language Models and Grammatical Rules
    Lopez-Cozar, Ramon
    Griol, David
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2998 - +
  • [25] Machine Learning for Spoken Dialogue Systems
    Lemon, Oliver
    Pietquin, Olivier
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1761 - +
  • [26] Modeling user behavior online for disambiguating user input in a spoken dialogue system
    Wang, Fangju
    Swegles, Kyle
    SPEECH COMMUNICATION, 2013, 55 (01) : 84 - 98
  • [27] A model for incremental grounding in spoken dialogue systems
    Thomas Visser
    David Traum
    David DeVault
    Rieks op den Akker
    Journal on Multimodal User Interfaces, 2014, 8 : 61 - 73
  • [28] A model for incremental grounding in spoken dialogue systems
    Visser, Thomas
    Traum, David
    DeVault, David
    op den Akker, Rieks
    JOURNAL ON MULTIMODAL USER INTERFACES, 2014, 8 (01) : 61 - 73
  • [29] Loss less Value Directed Compression of Complex User Goal States for Statistical Spoken Dialogue Systems
    Crook, Paul A.
    Lemon, Oliver
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1036 - 1039
  • [30] An Analysis of Older Users' Interactions with Spoken Dialogue Systems
    Bost, Jamie
    Moore, Johanna D.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1176 - 1181