Incremental Imitation Learning of Context-Dependent Motor Skills

被引：0

作者：

Ewerton, Marco ^{[1
]}

Maeda, Guilherme ^{[1
]}

Kollegger, Gerrit ^{[2
]}

Wiemeyer, Josef ^{[2
]}

Peters, Jan ^{[1
,3
]}

机构：

[1] Tech Univ Darmstadt, Dept Comp Sci, Intelligent Autonomous Syst Grp, Hochschulstr 10, D-64289 Darmstadt, Germany

[2] Tech Univ Darmstadt, Inst Sport Sci, Magdalenenstr 27, D-64289 Darmstadt, Germany

[3] Max Planck Inst Intelligent Syst, Spemannstr 38, D-72076 Tubingen, Germany

来源：

2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS) | 2016年

关键词：

MOTION PRIMITIVES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Teaching motor skills to robots through human demonstrations, an approach called "imitation learning", is an alternative to hand coding each new robot behavior. Imitation learning is relatively cheap in terms of time and labor and is a promising route to give robots the necessary functionalities for a widespread use in households, stores, hospitals, etc. However, current imitation learning techniques struggle with a number of challenges that prevent their wide usability. For instance, robots might not be able to accurately reproduce every human demonstration and it is not always clear how robots should generalize a movement to new contexts. This paper addresses those challenges by presenting a method to incrementally teach context-dependent motor skills to robots. The human demonstrates trajectories for different contexts by moving the links of the robot and partially or fully refines those trajectories by disturbing the movements of the robot while it executes the behavior it has learned so far. A joint probability distribution over trajectories and contexts can then be built based on those demonstrations and refinements. Given a new context, the robot computes the most probable trajectory, which can also be refined by the human. The joint probability distribution is incrementally updated with the refined trajectories. We have evaluated our method with experiments in which an elastically actuated robot arm with four degrees of freedom learns how to reach a ball at different positions.

引用

页码：351 / 358

页数：8

共 16 条

[1]

Akgun B, 2012, ACMIEEE INT CONF HUM, P391

[2]

[Anonymous], 2013, Advances in neural information processing systems

[3]

Argall B. D., 2010, 2010 IEEE 9th International Conference on Development and Learning (ICDL 2010), P7, DOI 10.1109/DEVLRN.2010.5578872

[4] BETTERING OPERATION OF ROBOTS BY LEARNING [J].

ARIMOTO, S ;

KAWAMURA, S ;

MIYAZAKI, F .

JOURNAL OF ROBOTIC SYSTEMS, 1984, 1 (02) :123-140

[5] A survey of iterative learning control [J].

Bristow, Douglas A. ;

Tharayil, Marina ;

Alleyne, Andrew G. .

IEEE CONTROL SYSTEMS MAGAZINE, 2006, 26 (03) :96-114

[6]

Calinon S., 2007, 2007 2nd Annual Conference on Human-Robot Interaction (HRI), P255

[7] A tutorial on task-parameterized movement learning and retrieval [J].

Calinon, Sylvain .

INTELLIGENT SERVICE ROBOTICS, 2016, 9 (01) :1-29

[8]

Calinon S, 2013, IEEE INT C INT ROBOT, P610, DOI 10.1109/IROS.2013.6696414

[9] Reinforcement learning to adjust parametrized motor primitives to new situations [J].

Kober, Jens ;

Wilhelm, Andreas ;

Oztop, Erhan ;

Peters, Jan .

AUTONOMOUS ROBOTS, 2012, 33 (04) :361-379

[10] Incremental learning of full body motion primitives and their sequencing through human motion observation [J].

Kulic, Dana ;

Ott, Christian ;

Lee, Dongheui ;

Ishikawa, Junichi ;

Nakamura, Yoshihiko .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2012, 31 (03) :330-345

← 1 2 →