Dynamic motion learning for multi-DOF flexible-joint robots using active-passive motor babbling through deep learning

被引：13

作者：

Takahashi, Kuniyuki ^{[1
,2
]}

Ogata, Tetsuya ^{[3
]}

Nakanishi, Jun ^{[4
]}

Cheng, Gordon ^{[5
]}

Sugano, Shigeki ^{[1
]}

机构：

[1] Waseda Univ, Grad Sch Creat Sci & Engn, Tokyo, Japan

[2] Japan Soc Promot Sci, Tokyo, Japan

[3] Waseda Univ, Grad Sch Fundamental Sci & Engn, Tokyo, Japan

[4] Nagoya Univ, Dept Micronano Mech Sci & Engn, Nagoya, Aichi, Japan

[5] Tech Univ Munich, Inst Cognit Syst, Munich, Germany

来源：

ADVANCED ROBOTICS | 2017年 / 31卷 / 18期

关键词：

Motor babbling; flexible-joint robot; dynamic motion learning; recurrent neural network; deep learning; SELF;

D O I：

10.1080/01691864.2017.1383939

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This paper proposes a learning strategy for robots with flexible joints having multi-degrees of freedom in order to achieve dynamic motion tasks. In spite of there being several potential benefits of flexible-joint robots such as exploitation of intrinsic dynamics and passive adaptation to environmental changes with mechanical compliance, controlling such robots is challenging because of increased complexity of their dynamics. To achieve dynamic movements, we introduce a two-phase learning framework of the body dynamics of the robot using a recurrent neural network motivated by a deep learning strategy. The proposed methodology comprises a pre-training phase with motor babbling and a fine-tuning phase with additional learning of the target tasks. In the pre-training phase, we consider active and passive exploratory motions for efficient acquisition of body dynamics. In the fine-tuning phase, the learned body dynamics are adjusted for specific tasks. We demonstrate the effectiveness of the proposed methodology in achieving dynamic tasks involving constrained movement requiring interactions with the environment on a simulated robot model and an actual PR2 robot both of which have a compliantly actuated seven degree-of-freedom arm. The results illustrate a reduction in the required number of training iterations for task learning and generalization capabilities for untrained situations.

引用

页码：1002 / 1015

页数：14

共 26 条

[1]

[Anonymous], THESIS

[2]

[Anonymous], 2016, INT J ROBOT RES

[3]

Asano Y, 2013, IEEE INT C INT ROBOT, P4649, DOI 10.1109/IROS.2013.6697025

[4] Robots Driven by Compliant Actuators: Optimal Control Under Actuation Constraints [J].

Braun, David J. ;

Petit, Florian ;

Huber, Felix ;

Haddadin, Sami ;

van der Smagt, Patrick ;

Albu-Schaeffer, Alin ;

Vijayakumar, Sethu .

IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (05) :1085-1101

[5]

D'Souza A, 2001, IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P298, DOI 10.1109/IROS.2001.973374

[6]

Diftler MA, 2011, IEEE INT CONF ROBOT, P2178

[7] FINDING STRUCTURE IN TIME [J].

ELMAN, JL .

COGNITIVE SCIENCE, 1990, 14 (02) :179-211

[8]

Glorot X, 2010, P 13 INT C ART INT S, P249

[9] Reducing the dimensionality of data with neural networks [J].

Hinton, G. E. ;

Salakhutdinov, R. R. .

SCIENCE, 2006, 313 (5786) :504-507

[10]

Iwasaki T, 2012, IEEE-RAS INT C HUMAN, P449, DOI 10.1109/HUMANOIDS.2012.6651558

← 1 2 3 →