Combining language models in the input interface of a spoken dialogue system

被引:17
作者
Lopez-Cozar, R. [1 ]
Callejas, Z. [1 ]
机构
[1] Univ Granada, Fac Comp Sci, Dept Languages & Comp Syst, E-18071 Granada, Spain
关键词
D O I
10.1016/j.csl.2005.05.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new technique to enhance the performance of the input interface of spoken dialogue systems based on a procedure that combines during speech recognition the advantages of using prompt-dependent language models with those of using a language model independent of the prompts generated by the dialogue system. The technique proposes to create a new speech recognizer, termed contextual speech recognizer, that uses a prompt-independent language model to allow recognizing any kind of sentence permitted in the application domain, and at the same time, uses contextual information (in the form of prompt-dependent language models) to take into account that some sentences are more likely to be uttered than others at a particular moment of the dialogue. The experiments show the technique allows enhancing clearly the performance of the input interface of a previously developed dialogue system based exclusively on prompt-dependent language models. But most important, in comparison with a standard speech recognizer that uses just one prompt-independent language model without contextual information, the proposed recognizer allows increasing the word accuracy and sentence understanding rates by 4.09% and 4.19% absolute, respectively. These scores are slightly better than those obtained using linear interpolation of the prompt-independent and prompt-dependent language models used in the experiments. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:420 / 440
页数:21
相关论文
共 40 条
[1]  
[Anonymous], P EUR
[2]  
[Anonymous], P EUR
[3]  
[Anonymous], P EUR
[4]  
[Anonymous], P INT C SPOK LANG PR
[5]  
BACA J, 2003, P EUR C SPEECH COMM, P1929
[6]  
BERNSEN NO, 2003, P EUR C SPEECH COMM, P737
[7]  
BONNEAUMAYNARD H, 2003, P EUR, P253
[8]   WIZARD OF OZ STUDIES - WHY AND HOW [J].
DAHLBACK, N ;
JONSSON, A ;
AHRENBERG, L .
KNOWLEDGE-BASED SYSTEMS, 1993, 6 (04) :258-266
[9]   An interactive dialog system for learning Japanese [J].
Ehsani, F ;
Bernstein, J ;
Najmi, A .
SPEECH COMMUNICATION, 2000, 30 (2-3) :167-177
[10]  
EMAMI A, 2003, P EUR, P413