Machine Learning for Social Multiparty Human-Robot Interaction

被引:31
作者
Keizer, Simon [1 ]
Foster, Mary Ellen [1 ]
Wang, Zhuoran [1 ]
Lemon, Oliver [1 ]
机构
[1] Heriot Watt Univ, Interact Lab, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
基金
欧盟第七框架计划;
关键词
Algorithms; Design; Performance; Social robotics; machine learning; multiuser interaction;
D O I
10.1145/2600021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a variety of machine-learning techniques that are being applied to social multiuser humanrobot interaction using a robot bartender in our scenario. We first present a data-driven approach to social state recognition based on supervised learning. We then describe an approach to social skills executionthat is, action selection for generating socially appropriate robot behavior-which is based on reinforcement learning, using a data-driven simulation of multiple users to train execution policies for social skills. Next, we describe how these components for social state recognition and skills execution have been integrated into an end-to-end robot bartender system, and we discuss the results of a user evaluation. Finally, we present an alternative unsupervised learning framework that combines social state recognition and social skills execution based on hierarchical Dirichlet processes and an infinite POMDP interaction manager. The models make use of data from both human-human interactions collected in a number of German bars and human-robot interactions recorded in the evaluation of an initial version of the system.
引用
收藏
页数:32
相关论文
共 59 条
[51]   Perseus: Randomized point-based value iteration for POMDPs [J].
Spaan, MTJ ;
Vlassis, N .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 24 :195-220
[52]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[53]   Hierarchical Dirichlet processes [J].
Teh, Yee Whye ;
Jordan, Michael I. ;
Beal, Matthew J. ;
Blei, David M. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (476) :1566-1581
[54]   Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems [J].
Thomson, Blaise ;
Young, Steve .
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04) :562-588
[55]  
Wang Z, 2012, P SLT, DOI [10.1109/SLT.2012.6424162, DOI 10.1109/SLT.2012.6424162]
[56]  
White Michael, 2006, RES LANGUAGE COMPUTA, V4.1, P39, DOI DOI 10.1007/S11168-006-9010-2
[57]   Partially observable Markov decision processes for spoken dialog systems [J].
Williams, Jason D. ;
Young, Steve .
COMPUTER SPEECH AND LANGUAGE, 2007, 21 (02) :393-422
[58]  
Wittenburg P., 2006, PROC 5 INT C LANGUAG, P1556
[59]   The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management [J].
Young, Steve ;
Gasic, Milica ;
Keizer, Simon ;
Mairesse, Francois ;
Schatzmann, Jost ;
Thomson, Blaise ;
Yu, Kai .
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02) :150-174