A Humanoid Robot Learns to Recover Perturbation During Swinging Motion

被引:8
作者
Tran, Duy Hoa [1 ]
Hamker, Fred [1 ]
Nassour, John [1 ]
机构
[1] Tech Univ Chemnitz, Fac Comp Sci, Artificial Intelligence Lab, D-09107 Chemnitz, Germany
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 10期
关键词
Neurons; Perturbation methods; Humanoid robots; Legged locomotion; Generators; Switches; Central pattern generator (CPG); fall detection; push recovery; reinforcement learning; self-organizing map (SOM); WALKING PATTERN GENERATION; BIPED WALKING; PUSH RECOVERY; FEATURE-SELECTION; PREVIEW CONTROL; LOCOMOTION; CONTROLLER; MODEL; GAIT; DRIVEN;
D O I
10.1109/TSMC.2018.2884619
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an approach on fall detection and recovery perturbation during humanoid robot swinging. Reinforcement learning (Q-learning) is employed to explore relationship between actions and states that allow the robot to trigger a reaction to avoid falling. A self-organizing map (SOM) is employed using a circular topological neighborhood function to transform continuous exteroceptive information of the robot during stable swinging into a discrete representation of states. We take advantage of the SOM clustering and topology preservation in the perturbation detection. Swinging and recovery actions are generated from the same neural model using a multilayered multipattern central pattern generator. Experiments, which were carried out in the simulation and on the real humanoid robot (NAO), show that our approach allows humanoid robots to recover from pushing successfully by learning to switch from a rhythmic to an appropriate nonrhythmic behavior.
引用
收藏
页码:3701 / 3712
页数:12
相关论文
共 56 条
  • [1] [Anonymous], 1998, ADAPTIVE COMPUTATION
  • [2] Bellotti F, 2011, PROCEEDINGS OF THE 5TH EUROPEAN CONFERENCE ON GAMES BASED LEARNING, P26
  • [3] Brown TG, 1914, J PHYSIOL-LONDON, V48, P18
  • [4] Toward simple control for complex, autonomous robotic applications: combining discrete and rhythmic motor primitives
    Degallier, Sarah
    Righetti, Ludovic
    Gay, Sebastien
    Ijspeert, Auke
    [J]. AUTONOMOUS ROBOTS, 2011, 31 (2-3) : 155 - 181
  • [5] Online Walking Gait Generation with Adaptive Foot Positioning through Linear Model Predictive Control
    Diedam, Holger
    Dimitrov, Dimitar
    Wieber, Pierre-Brice
    Mombaur, Katja
    Diehl, Moritz
    [J]. 2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 1121 - 1126
  • [6] Doppmann C, 2015, IEEE INT CONF ROBOT, P5551, DOI 10.1109/ICRA.2015.7139975
  • [7] Learning CPG-based biped locomotion with a policy gradient method: Application to a humanoid robot
    Endo, Gen
    Morimoto, Jun
    Matsubara, Takamitsu
    Nakanishi, Jun
    Cheng, Gordon
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02) : 213 - 228
  • [8] Englsberger J, 2011, IEEE INT C INT ROBOT, P4420, DOI 10.1109/IROS.2011.6048045
  • [9] Fast biped walking with a sensor-driven neuronal controller and real-time online learning
    Geng, T
    Porr, B
    Wörgötter, F
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2006, 25 (03) : 243 - 259
  • [10] Machine Learning Capabilities of a Simulated Cerebellum
    Hausknecht, Matthew
    Li, Wen-Ke
    Mauk, Michael
    Stone, Peter
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 510 - 522