Robots learn to dance through interaction with humans

被引:0
作者
Qinggang Meng
Ibrahim Tholley
Paul W. H. Chung
机构
[1] Loughborough University,Department of Computer Science
来源
Neural Computing and Applications | 2014年 / 24卷
关键词
Robot dancing; Robot learning; Robot adaptation; Robot interaction with humans;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we investigated an approach for robots to learn to adapt dance actions to human’s preferences through interaction and feedback. Human’s preferences were extracted by analysing the common action patterns with positive or negative feedback from the human during robot dancing. By using a buffering technique to store the dance actions before a feedback, each individual’s preferences can be extracted even when a reward is received late. The extracted preferred dance actions from different people were then combined to generate improved dance sequences, i.e. performing more of what was preferred and less of that was not preferred. Together with Softmax action-selection method, the Sarsa reinforcement learning algorithm was used as the underlining learning algorithm and to effectively control the trade-off between exploitation of the learnt dance skills and exploration of new dance actions. The results showed that the robot learnt, using interactive reinforcement learning, the preferences of human partners, and the dance improved with the extracted preferences from more human partners.
引用
收藏
页码:117 / 124
页数:7
相关论文
共 6 条
  • [1] Aucouturier JJ(2008)Cheek to chip: dancing robots and AI’s future IEEE Intell Syst 23 74-84
  • [2] Shiratori T(2008)Synthesis of dance performance based on analyses of human motion and music IPSJ Trans Comput Vis Image Media 1 34-47
  • [3] Ikeuchi K(2006)Dancing-to-music character animation Comput Graph Forum Proc Eurograph 2006 449-458
  • [4] Shiratori T(undefined)undefined undefined undefined undefined-undefined
  • [5] Nakazawa A(undefined)undefined undefined undefined undefined-undefined
  • [6] Ikeuchi K(undefined)undefined undefined undefined undefined-undefined