A distributed and morphology-independent strategy for adaptive locomotion in self-reconfigurable modular robots

被引:50
作者
Christensen, David Johan [1 ]
Schultz, Ulrik Pagh [2 ]
Stoy, Kasper [2 ]
机构
[1] Tech Univ Denmark, Dept Elect Engn, DK-2800 Lyngby, Denmark
[2] Univ Southern Denmark, Maersk McKinney Moller Inst, Modular Robot Lab, Odense, Denmark
关键词
Self-reconfigurable modular robots; Locomotion; Online learning; Distributed control; Fault tolerance; CENTRAL PATTERN GENERATORS; MULTIMODE LOCOMOTION; DESIGN; CONTROLLERS; CHALLENGES; SYSTEMS; ONLINE; CONRO;
D O I
10.1016/j.robot.2013.05.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a distributed reinforcement learning strategy for morphology-independent lifelong gait learning for modular robots. All modules run identical controllers that locally and independently optimize their action selection based on the robot's velocity as a global, shared reward signal. We evaluate the strategy experimentally mainly on simulated, but also on physical, modular robots. We find that the strategy: (i) for six of seven configurations (3-12 modules) converge in 96% of the trials to the best known action-based gaits within 15 min, on average, (ii) can be transferred to physical robots with a comparable performance, (iii) can be applied to learn simple gait control tables for both M-TRAN and ATRON robots, (iv) enables an 8-module robot to adapt to faults and changes in its morphology, and (v) can learn gaits for up to 60 module robots but a divergence effect becomes substantial from 20-30 modules. These experiments demonstrate the advantages of a distributed learning strategy for modular robots, such as simplicity in implementation, low resource requirements, morphology independence, reconfigurability, and fault tolerance. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1021 / 1035
页数:15
相关论文
共 83 条
[1]  
[Anonymous], 1994, THESIS STANFORD U
[2]  
Bongard JC, 2003, MORPHO-FUNCTIONAL MACHINES: THE NEW SPECIES, P237
[3]   Resilient machines through continuous self-modeling [J].
Bongard, Josh ;
Zykov, Victor ;
Lipson, Hod .
SCIENCE, 2006, 314 (5802) :1118-1121
[4]  
Brandt David, 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, P2375, DOI 10.1109/IROS.2007.4399191
[5]  
BROOKS RA, 1992, FROM ANIM ANIMAT, P3
[6]   Offline GA-Based Optimization for Heterogeneous Modular Multiconfigurable Chained Microrobots [J].
Brunete, Alberto ;
Hernando, Miguel ;
Gambao, Ernesto .
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2013, 18 (02) :578-585
[7]  
Butler Z, 2003, 2003 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION, VOLS I-III, PROCEEDINGS, P880
[8]   CONRO: Towards deployable robots with inter-robots metamorphic capabilities [J].
Castano, A ;
Shen, WM ;
Will, P .
AUTONOMOUS ROBOTS, 2000, 8 (03) :309-324
[9]  
CHIRIKJIAN GS, 1994, IEEE INT CONF ROBOT, P449, DOI 10.1109/ROBOT.1994.351256
[10]  
Christensen David Johan, 2010, 2010 IEEE International Conference on Robotics and Automation (ICRA 2010), P2765, DOI 10.1109/ROBOT.2010.5509942