Scaling POMDPs for spoken dialog management

被引:52
作者
Williams, Jason D. [1 ]
Young, Steve
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
[2] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 07期
关键词
decision theory; dialog management; partially observable Markov decision process (POMDP); planning under uncertainty; spoken dialog system (SDS);
D O I
10.1109/TASL.2007.902050
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Control in spoken dialog systems is challenging largely because automatic speech recognition is unreliable, and hence the state of the conversation can never be known with certainty. Partially observable Markov decision processes (POMDPs) provide a principled mathematical framework for planning and control in this context; however, POMDPs face severe scalability challenges, and past work has been limited to trivially small dialog tasks. This paper presents a novel POMDP optimization technique-composite summary point-based value iteration (CSPBVI)-which enables optimization to be performed on slot-filling POMDP-based dialog managers of a realistic size. Using dialog models trained on data from a tourist information domain, simulation results show that CSPBVI scales effectively, outperforms non-POMDP baselines, and is robust to estimation errors.
引用
收藏
页码:2116 / 2129
页数:14
相关论文
共 35 条
[1]  
[Anonymous], 1971, THESIS I OPERATIONS
[3]  
Boutilier C, 1996, PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, P1168
[4]  
Christopher John Cornish Hellaby Watkins, 1989, LEARNING DELAYED REW
[5]  
Drake A. W., 1962, Observation of a Markov process through a noisy channel
[6]  
GUESTRIN C, 2001, P WORKSH PLANN UNC I, P67
[7]  
Hansen E. A., 2000, Proceedings of the Fifth International Conference on Artificial Intelligence Planning and Scheduling, P130
[8]  
HORVITZ E, 2000, P INT C SPOK LANG PR, P226
[9]   Planning and acting in partially observable stochastic domains [J].
Kaelbling, LP ;
Littman, ML ;
Cassandra, AR .
ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) :99-134
[10]  
Larsson S., 2000, Natural Language Engineering, V6, P323, DOI 10.1017/S1351324900002539