A statistical approach to spoken dialog systems design and evaluation

被引:83
作者
Griol, David [1 ]
Hurtado, Lluis F. [1 ]
Segarra, Encarna [1 ]
Sanchis, Emilio [1 ]
机构
[1] Univ Politecn Valencia, Dept Sistemes Informat & Comp, E-46022 Valencia, Spain
关键词
spoken dialog systems; statistical models; dialog management; user simulation; system evaluation;
D O I
10.1016/j.specom.2008.04.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a statistical approach for the development of a dialog manager and for learning optimal dialog strategies. This methodology is based oil a classification procedure that considers all of the previous history of the dialog to select the next system answer. To evaluate the performance of the dialog system, the statistical approach for dialog management has been extended to model the user behavior. The statistical user simulator has been used for the evaluation and improvement of the dialog strategy. Both the user model and the system model arc automatically learned from a training corpus that is labeled in terms of dialog acts. New measures have been defined to evaluate the performance of the dialog system. Using these measures, we evaluate both the quality of the simulated dialogs and the improvement of the new dialog strategy that is obtained with the interaction of the two modules. This methodology has been applied to develop a dialog manager within the framework of the DIHANA project, whose goal is the design and development of a dialog system to access a railway information system using spontaneous speech in Spanish. We propose the use of corpus-based methodologies to develop the main modules in the dialog system. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:666 / 682
页数:17
相关论文
共 58 条
[1]  
[Anonymous], NATURAL LANGUAGE ENG
[2]  
[Anonymous], 2003, 4th SIGdial Workshop on Discourse and Dialogue
[3]  
[Anonymous], P EUR
[4]  
[Anonymous], 2001, P NAACL WORKSH AD DI
[5]  
Benedi Jose-Miguel, 2006, 5 INT C LANG RES EV, P1636
[6]  
Bishop CM., 1995, Neural networks for pattern recognition
[7]  
BONAFONTE A, 2000, P PRIM JORN TECN HAB
[8]  
CASTRO MJ, 2003, LNCS, V2527, P664
[9]  
Cuayáhuitl H, 2005, 2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P290
[10]  
CUAYAHUITL H, 2006, P INTERSP 06 ICSLP P, P469