Learning to move in modular robots using central pattern generators and online optimization

被引:99
作者
Sproewitz, Alexander [1 ]
Moeckel, Rico [1 ]
Maye, Jerome [1 ]
Ijspeert, Auke Jan [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
关键词
central pattern generators; modular robotics; online learning; locomotion; Powell's method; bluetooth;
D O I
10.1177/0278364907088401
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This article addresses the problem of how modular robotics systems, i.e. systems composed of multiple modules that can be configured into different robotic structures, can learn to locomote. In particular, we tackle the problems of online learning, that is, learning while moving, and the problem of dealing with unknown arbitrary robotic structures. We propose a framework for learning locomotion controllers based on two components: a central pattern generator (CPG) and a gradient-free optimization algorithm referred to as Powell's method. The CPG is implemented as a system of coupled nonlinear oscillators in our YaMoR modular robotic system, with one oscillator per module. The nonlinear oscillators are coupled together across modules using Bluetooth communication to obtain specific gaits, i.e. synchronized patterns of oscillations among modules. Online learning involves running the Powell optimization algorithm in parallel with the CPG model, with the speed of locomotion being the criterion to be optimized. Interesting aspects of the optimization include the fact that it is carried out online, the robots do not require stopping or resetting and it is fast. We present results showing the interesting properties of this framework for a modular robotic system. In particular, our CPG model can readily be implemented in a distributed system, it is computationally cheap, it exhibits limit cycle behavior ( temporary perturbations are rapidly forgotten), it produces smooth trajectories even when control parameters are abruptly changed and it is robust against imperfect communication among modules. We also present results of learning to move with three different robot structures. Interesting locomotion modes are obtained after running the optimization for less than 60 minutes.
引用
收藏
页码:423 / 443
页数:21
相关论文
共 55 条
[1]  
[Anonymous], 1994, THESIS STANFORD U
[2]   Stability analysis of a simple walking model driven by an oscillator with a phase reset using sensory feedback [J].
Aoi, S ;
Tsuchiya, K .
IEEE TRANSACTIONS ON ROBOTICS, 2006, 22 (02) :391-397
[3]   An adaptive, self-organizing dynamical system for hierarchical control of bio-inspired locomotion [J].
Arena, P ;
Fortuna, L ;
Frasca, M ;
Sicurella, G .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (04) :1823-1837
[4]  
Beer Randall D., 1992, Adaptive Behavior, V1, P91, DOI 10.1177/105971239200100105
[5]   Resilient machines through continuous self-modeling [J].
Bongard, Josh ;
Zykov, Victor ;
Lipson, Hod .
SCIENCE, 2006, 314 (5802) :1118-1121
[6]  
Brent R. P., 1973, ALGORITHMS MINIMIZAT
[7]   Finding resonance: Adaptive frequency oscillators for dynamic legged locomotion [J].
Buchli, Jonas ;
Iida, Fumiya ;
Ijspeert, Auke Jan .
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, :3903-+
[8]   Engineering entrainment and adaptation in limit cycle systems - From biological inspiration to applications in robotics [J].
Buchli, Jonas ;
Righetti, Ludovic ;
Ijspeert, Auke Jan .
BIOLOGICAL CYBERNETICS, 2006, 95 (06) :645-664
[9]   THE NATURE OF THE COUPLING BETWEEN SEGMENTAL OSCILLATORS OF THE LAMPREY SPINAL GENERATOR FOR LOCOMOTION - A MATHEMATICAL-MODEL [J].
COHEN, AH ;
HOLMES, PJ ;
RAND, RH .
JOURNAL OF MATHEMATICAL BIOLOGY, 1982, 13 (03) :345-369
[10]   Efficient bipedal robots based on passive-dynamic walkers [J].
Collins, S ;
Ruina, A ;
Tedrake, R ;
Wisse, M .
SCIENCE, 2005, 307 (5712) :1082-1085