Robots learn to dance through interaction with humans

被引：20

作者：

Meng, Qinggang ^{[1
]}

Tholley, Ibrahim ^{[1
]}

Chung, Paul W. H. ^{[1
]}

机构：

[1] Univ Loughborough, Dept Comp Sci, Loughborough, Leics, England

来源：

NEURAL COMPUTING & APPLICATIONS | 2014年 / 24卷 / 01期

关键词：

Robot dancing; Robot learning; Robot adaptation; Robot interaction with humans;

D O I：

10.1007/s00521-013-1504-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we investigated an approach for robots to learn to adapt dance actions to human's preferences through interaction and feedback. Human's preferences were extracted by analysing the common action patterns with positive or negative feedback from the human during robot dancing. By using a buffering technique to store the dance actions before a feedback, each individual's preferences can be extracted even when a reward is received late. The extracted preferred dance actions from different people were then combined to generate improved dance sequences, i.e. performing more of what was preferred and less of that was not preferred. Together with Softmax action-selection method, the Sarsa reinforcement learning algorithm was used as the underlining learning algorithm and to effectively control the trade-off between exploitation of the learnt dance skills and exploration of new dance actions. The results showed that the robot learnt, using interactive reinforcement learning, the preferences of human partners, and the dance improved with the extracted preferences from more human partners.

引用

页码：117 / 124

页数：8

共 23 条

[1] [Anonymous], IEEE T COMPUT VIS IM
[2] Aucouturier JJ, 2008, IEEE INTELL SYST, V23, P74, DOI 10.1109/MIS.2008.22
[3] Austermann A, 2008, 2008 17TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1 AND 2, P41, DOI 10.1109/ROMAN.2008.4600641
[4] Cyberbotics, 2011, WEB 6 FAST PROT SIM
[5] Dozier G, 2001, 16 ACM S APPL COMP S, P340
[6] Fang Liu, 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788), P4865
[7] Holldampf Jens, 2010, 2010 RO-MAN: The 19th IEEE International Symposium on Robot and Human Interactive Communication, P527, DOI 10.1109/ROMAN.2010.5598616
[8] Kober J, 2013, INT J ROB R IN PRESS
[9] Belief revision with reinforcement learning for interactive object recognition
Leopold, Thomas
Kern-Isberner, Gabriele
Peters, Gabriele
[J]. ECAI 2008, PROCEEDINGS, 2008, 178 : 65 - +
[10] Peralta Raquel Torres, 2011, 2011 RO-MAN: The 20th IEEE International Symposium on Robot and Human Interactive Communication, P113, DOI 10.1109/ROMAN.2011.6005273

← 1 2 3 →