A distributed and morphology-independent strategy for adaptive locomotion in self-reconfigurable modular robots

被引：50

作者：

Christensen, David Johan ^{[1
]}

Schultz, Ulrik Pagh ^{[2
]}

Stoy, Kasper ^{[2
]}

机构：

[1] Tech Univ Denmark, Dept Elect Engn, DK-2800 Lyngby, Denmark

[2] Univ Southern Denmark, Maersk McKinney Moller Inst, Modular Robot Lab, Odense, Denmark

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 2013年 / 61卷 / 09期

关键词：

Self-reconfigurable modular robots; Locomotion; Online learning; Distributed control; Fault tolerance; CENTRAL PATTERN GENERATORS; MULTIMODE LOCOMOTION; DESIGN; CONTROLLERS; CHALLENGES; SYSTEMS; ONLINE; CONRO;

D O I：

10.1016/j.robot.2013.05.009

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present a distributed reinforcement learning strategy for morphology-independent lifelong gait learning for modular robots. All modules run identical controllers that locally and independently optimize their action selection based on the robot's velocity as a global, shared reward signal. We evaluate the strategy experimentally mainly on simulated, but also on physical, modular robots. We find that the strategy: (i) for six of seven configurations (3-12 modules) converge in 96% of the trials to the best known action-based gaits within 15 min, on average, (ii) can be transferred to physical robots with a comparable performance, (iii) can be applied to learn simple gait control tables for both M-TRAN and ATRON robots, (iv) enables an 8-module robot to adapt to faults and changes in its morphology, and (v) can learn gaits for up to 60 module robots but a divergence effect becomes substantial from 20-30 modules. These experiments demonstrate the advantages of a distributed learning strategy for modular robots, such as simplicity in implementation, low resource requirements, morphology independence, reconfigurability, and fault tolerance. (C) 2013 Elsevier B.V. All rights reserved.

引用

页码：1021 / 1035

页数：15

共 83 条

[1]

[Anonymous], 1994, THESIS STANFORD U

[2]

Bongard JC, 2003, MORPHO-FUNCTIONAL MACHINES: THE NEW SPECIES, P237

[3] Resilient machines through continuous self-modeling [J].

Bongard, Josh ;

Zykov, Victor ;

Lipson, Hod .

SCIENCE, 2006, 314 (5802) :1118-1121

[4]

Brandt David, 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, P2375, DOI 10.1109/IROS.2007.4399191

[5]

BROOKS RA, 1992, FROM ANIM ANIMAT, P3

[6] Offline GA-Based Optimization for Heterogeneous Modular Multiconfigurable Chained Microrobots [J].

Brunete, Alberto ;

Hernando, Miguel ;

Gambao, Ernesto .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2013, 18 (02) :578-585

[7]

Butler Z, 2003, 2003 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION, VOLS I-III, PROCEEDINGS, P880

[8] CONRO: Towards deployable robots with inter-robots metamorphic capabilities [J].

Castano, A ;

Shen, WM ;

Will, P .

AUTONOMOUS ROBOTS, 2000, 8 (03) :309-324

[9]

CHIRIKJIAN GS, 1994, IEEE INT CONF ROBOT, P449, DOI 10.1109/ROBOT.1994.351256

[10]

Christensen David Johan, 2010, 2010 IEEE International Conference on Robotics and Automation (ICRA 2010), P2765, DOI 10.1109/ROBOT.2010.5509942

← 1 2 3 4 5 6 7 8 9 →