Machine Learning for Social Multiparty Human-Robot Interaction

被引:32
作者
Keizer, Simon [1 ]
Foster, Mary Ellen [1 ]
Wang, Zhuoran [1 ]
Lemon, Oliver [1 ]
机构
[1] Heriot Watt Univ, Interact Lab, Sch Math & Comp Sci, Edinburgh EH14 4AS, Midlothian, Scotland
基金
欧盟第七框架计划;
关键词
Algorithms; Design; Performance; Social robotics; machine learning; multiuser interaction;
D O I
10.1145/2600021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a variety of machine-learning techniques that are being applied to social multiuser humanrobot interaction using a robot bartender in our scenario. We first present a data-driven approach to social state recognition based on supervised learning. We then describe an approach to social skills executionthat is, action selection for generating socially appropriate robot behavior-which is based on reinforcement learning, using a data-driven simulation of multiple users to train execution policies for social skills. Next, we describe how these components for social state recognition and skills execution have been integrated into an end-to-end robot bartender system, and we discuss the results of a user evaluation. Finally, we present an alternative unsupervised learning framework that combines social state recognition and social skills execution based on hierarchical Dirichlet processes and an infinite POMDP interaction manager. The models make use of data from both human-human interactions collected in a number of German bars and human-robot interactions recorded in the evaluation of an initial version of the system.
引用
收藏
页数:32
相关论文
共 59 条
  • [51] Perseus: Randomized point-based value iteration for POMDPs
    Spaan, MTJ
    Vlassis, N
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 24 : 195 - 220
  • [52] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
  • [53] Hierarchical Dirichlet processes
    Teh, Yee Whye
    Jordan, Michael I.
    Beal, Matthew J.
    Blei, David M.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (476) : 1566 - 1581
  • [54] Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
    Thomson, Blaise
    Young, Steve
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04) : 562 - 588
  • [55] Wang Z, 2012, P SLT, DOI [10.1109/SLT.2012.6424162, DOI 10.1109/SLT.2012.6424162]
  • [56] White Michael, 2006, RES LANGUAGE COMPUTA, V4.1, P39, DOI DOI 10.1007/S11168-006-9010-2
  • [57] Partially observable Markov decision processes for spoken dialog systems
    Williams, Jason D.
    Young, Steve
    [J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (02) : 393 - 422
  • [58] Wittenburg P., 2006, PROC 5 INT C LANGUAG, P1556
  • [59] The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management
    Young, Steve
    Gasic, Milica
    Keizer, Simon
    Mairesse, Francois
    Schatzmann, Jost
    Thomson, Blaise
    Yu, Kai
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02) : 150 - 174