A Humanoid Robot Learns to Recover Perturbation During Swinging Motion

被引：8

作者：

Tran, Duy Hoa ^{[1
]}

Hamker, Fred ^{[1
]}

Nassour, John ^{[1
]}

机构：

[1] Tech Univ Chemnitz, Fac Comp Sci, Artificial Intelligence Lab, D-09107 Chemnitz, Germany

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 10期

关键词：

Neurons; Perturbation methods; Humanoid robots; Legged locomotion; Generators; Switches; Central pattern generator (CPG); fall detection; push recovery; reinforcement learning; self-organizing map (SOM); WALKING PATTERN GENERATION; BIPED WALKING; PUSH RECOVERY; FEATURE-SELECTION; PREVIEW CONTROL; LOCOMOTION; CONTROLLER; MODEL; GAIT; DRIVEN;

D O I：

10.1109/TSMC.2018.2884619

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present an approach on fall detection and recovery perturbation during humanoid robot swinging. Reinforcement learning (Q-learning) is employed to explore relationship between actions and states that allow the robot to trigger a reaction to avoid falling. A self-organizing map (SOM) is employed using a circular topological neighborhood function to transform continuous exteroceptive information of the robot during stable swinging into a discrete representation of states. We take advantage of the SOM clustering and topology preservation in the perturbation detection. Swinging and recovery actions are generated from the same neural model using a multilayered multipattern central pattern generator. Experiments, which were carried out in the simulation and on the real humanoid robot (NAO), show that our approach allows humanoid robots to recover from pushing successfully by learning to switch from a rhythmic to an appropriate nonrhythmic behavior.

引用

页码：3701 / 3712

页数：12

共 56 条

[1] [Anonymous], 1998, ADAPTIVE COMPUTATION
[2] Bellotti F, 2011, PROCEEDINGS OF THE 5TH EUROPEAN CONFERENCE ON GAMES BASED LEARNING, P26
[3] Brown TG, 1914, J PHYSIOL-LONDON, V48, P18
[4] Toward simple control for complex, autonomous robotic applications: combining discrete and rhythmic motor primitives
Degallier, Sarah
Righetti, Ludovic
Gay, Sebastien
Ijspeert, Auke
[J]. AUTONOMOUS ROBOTS, 2011, 31 (2-3) : 155 - 181
[5] Online Walking Gait Generation with Adaptive Foot Positioning through Linear Model Predictive Control
Diedam, Holger
Dimitrov, Dimitar
Wieber, Pierre-Brice
Mombaur, Katja
Diehl, Moritz
[J]. 2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 1121 - 1126
[6] Doppmann C, 2015, IEEE INT CONF ROBOT, P5551, DOI 10.1109/ICRA.2015.7139975
[7] Learning CPG-based biped locomotion with a policy gradient method: Application to a humanoid robot
Endo, Gen
Morimoto, Jun
Matsubara, Takamitsu
Nakanishi, Jun
Cheng, Gordon
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02) : 213 - 228
[8] Englsberger J, 2011, IEEE INT C INT ROBOT, P4420, DOI 10.1109/IROS.2011.6048045
[9] Fast biped walking with a sensor-driven neuronal controller and real-time online learning
Geng, T
Porr, B
Wörgötter, F
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2006, 25 (03) : 243 - 259
[10] Machine Learning Capabilities of a Simulated Cerebellum
Hausknecht, Matthew
Li, Wen-Ke
Mauk, Michael
Stone, Peter
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 510 - 522

← 1 2 3 4 5 6 →