Learning CPG-based biped locomotion with a policy gradient method

被引:0
|
作者
Matsubara, T [1 ]
Morimoto, J [1 ]
Nakanishi, J [1 ]
Sato, M [1 ]
Doya, K [1 ]
机构
[1] Nara Inst Sci & Technol, Nara, Japan
来源
2005 5TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS | 2005年
关键词
reinforcement learning; policy gradient; biped locomotion; central pattern generator; WALKING;
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Recently, CPG-based controllers have been widely explored to achieve robust biped locomotion. However, this approach has difficulties in tuning open parameters in the controller. In this paper, we present a learning framework for CPG-based biped locomotion with a policy gradient method. We demonstrate that appropriate sensory feedback in the CPG-based control architecture can he acquired using the proposed method within a thousand trials by numerical simulations. We analyze linear stability of a periodic orbit of the acquired biped walking considering a return map. Furthermore, we apply the learned controllers in numerical simulations to our physical 5-link robot in order to empirically evaluate the effectiveness of the proposed framework. Experimental results suggest the robustness of the acquired controllers against environmental changes and variations in the mass properties of the robot.
引用
收藏
页码:208 / 213
页数:6
相关论文
共 50 条
  • [1] Learning CPG-based biped locomotion with a policy gradient method
    Matsubara, Takamitsu
    Morimoto, Jun
    Nakanishi, Jun
    Sato, Masa-aki
    Doya, Kenji
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (11) : 911 - 920
  • [2] Learning CPG-based biped locomotion with a policy gradient method: Application to a humanoid robot
    Endo, Gen
    Morimoto, Jun
    Matsubara, Takamitsu
    Nakanishi, Jun
    Cheng, Gordon
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02) : 213 - 228
  • [3] Learning sensory feedback to CPG with policy gradient for biped locomotion
    Matsubara, T
    Morimoto, J
    Nakanishi, J
    Sato, MA
    Doya, K
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4164 - 4169
  • [4] Biped Locomotion Control through a Biomimetic CPG-based Controller
    Santos, Cristina P.
    Alves, Nuno
    Moreno, Juan C.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (01) : 47 - 70
  • [5] Biped Locomotion Control through a Biomimetic CPG-based Controller
    Cristina P. Santos
    Nuno Alves
    Juan C. Moreno
    Journal of Intelligent & Robotic Systems, 2017, 85 : 47 - 70
  • [6] A CPG-based control method for the rolling locomotion of a desert spider
    Shi, Ruidong
    Zhang, Xiuli
    Tian, Yaobin
    Dong, Shouyang
    Yao, Yan'an
    2016 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS (ARSO), 2016, : 243 - 248
  • [7] A hybrid controller based on CPG and ZMP for biped locomotion
    Massah, Amir B.
    Zamani, Ali
    Salehinia, Yaser
    Aliyari, Mahdi Sh
    Teshnehlab, Mohammad
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2013, 27 (11) : 3473 - 3486
  • [8] FPGA implementation of a configurable neuromorphic CPG-based locomotion controller
    Hugo Barron-Zambrano, Jose
    Torres-Huitzil, Cesar
    NEURAL NETWORKS, 2013, 45 : 50 - 61
  • [9] An Approach for Adaptive Limbless Locomotion Using a CPG-Based Reflex Mechanism
    Li, Guoyuan
    Zhang, Houxiang
    Zhang, Jianwei
    Hildre, Hans Petter
    JOURNAL OF BIONIC ENGINEERING, 2014, 11 (03) : 389 - 399
  • [10] A CPG-based Locomotion Control Architecture for Hexapod Robot
    Yu, Haitao
    Guo, Wei
    Deng, Jing
    Li, Mantian
    Cai, Hegao
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 5615 - 5621