Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems

被引:0
作者
Janarthanam, Srinivasan [1 ]
Lemon, Oliver [2 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
[2] Heriot Watt Univ, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
来源
EMPIRICAL METHODS IN NATURAL LANGUAGE GENERATION: DATA-ORIENTED METHODS AND EMPIRICAL EVALUATION | 2010年 / 5790卷
基金
英国工程与自然科学研究理事会;
关键词
Reinforcement Learning; Referring Expression Generation; Spoken Dialogue System; LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem that different users have different lexical knowledge about problem domains, so that automated dialogue systems need to adapt their generation choices online to the users' domain knowledge as it encounters them. We approach this problem using Reinforcement Learning in Markov Decision Processes (MDP). We present a reinforcement learning framework to learn adaptive referring expression generation (REG) policies that can adapt dynamically to users with different domain knowledge levels. In contrast to related work we also propose a new statistical user model which incorporates the lexical knowledge of different users. We evaluate this framework by showing that it allows us to learn dialogue policies that automatically adapt their choice of referring expressions online to different users, and that these policies are significantly better than hand-coded adaptive policies for this problem. The learned policies are consistently between 2 and 8 turns shorter than a range of different hand-coded but adaptive baseline REG policies.
引用
收藏
页码:67 / +
页数:3
相关论文
共 41 条
[11]  
DALE R, 1995, COGNITIVE SCI, V19, P233, DOI 10.1207/s15516709cog1902_3
[12]  
DALE R, 1989, P ACL 1989
[13]  
GATT A, 2008, P INLG 2008
[14]  
GEORGILA K, 2005, P EUROSPEECH INTERSP
[15]  
HELLER D, 2009, P PRE COGSCI 2009
[16]   The curse of expertise: The effects of expertise and debiasing methods on predictions of novice performance [J].
Hinds, PJ .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 1999, 5 (02) :205-221
[17]   REFERENCES IN CONVERSATION BETWEEN EXPERTS AND NOVICES [J].
ISAACS, EA ;
CLARK, HH .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1987, 116 (01) :26-37
[18]  
JANARTHANAM S, 2009, P SIGDIAL 2009
[19]  
JANARTHANAM S, 2008, P SEMDIAL 2008
[20]  
JANARTHANAM S, 2009, P ENLG 2009