POLICY COMMITTEE FOR ADAPTATION IN MULTI-DOMAIN SPOKEN DIALOGUE SYSTEMS

被引：0

作者：

Gasic, M. ^{[1
]}

Mrksic, N. ^{[1
]}

Su, P-H. ^{[1
]}

Vandyke, D. ^{[1
]}

Wen, T-H. ^{[1
]}

Young, S. ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Trumpington St, Cambridge CB2 1PZ, England

来源：

2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2015年

基金：

英国工程与自然科学研究理事会;

关键词：

Bayesian committee machines; Gaussian processes; reinforcement learning;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Moving from limited-domain dialogue systems to open domain dialogue systems raises a number of challenges. One of them is the ability of the system to utilise small amounts of data from disparate domains to build a dialogue manager policy. Previous work has focused on using data from different domains to adapt a generic policy to a specific domain. Inspired by Bayesian committee machines, this paper proposes the use of a committee of dialogue policies. The results show that such a model is particularly beneficial for adaptation in multi-domain dialogue systems. The use of this model significantly improves performance compared to a single policy baseline, as confirmed by the performed real-user trial. This is the first time a dialogue policy has been trained on multiple domains on-line in interaction with real users.

引用

页码：806 / 812

页数：7

共 21 条

[1]

[Anonymous], P SIGDIAL

[2]

[Anonymous], TASLP

[3]

[Anonymous], P ICML

[4]

[Anonymous], P HLT

[5]

[Anonymous], P WORKSH ACT LEARN E

[6] Evaluation of a hierarchical reinforcement learning spoken dialogue system [J].

Cuayahuitl, Heriberto ;

Renals, Steve ;

Lemon, Oliver ;

Shimodaira, Hiroshi .

COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02) :395-429

[7]

Gasic M., 2015, P ICASSP

[8]

GASIC M, 2009, AUT SPEECH REC UND 2, P456

[9]

Heck L., 2013, Proc. of Conf. of the Int. Speech Commun. Assoc, P1594

[10]

Jebara T, 2004, J MACH LEARN RES, V5, P819

← 1 2 3 →