Learning CPG-based biped locomotion with a policy gradient method

被引：0

作者：

Matsubara, T ^{[1
]}

Morimoto, J ^{[1
]}

Nakanishi, J ^{[1
]}

Sato, M ^{[1
]}

Doya, K ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Nara, Japan

来源：

2005 5TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS | 2005年

关键词：

reinforcement learning; policy gradient; biped locomotion; central pattern generator; WALKING;

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Recently, CPG-based controllers have been widely explored to achieve robust biped locomotion. However, this approach has difficulties in tuning open parameters in the controller. In this paper, we present a learning framework for CPG-based biped locomotion with a policy gradient method. We demonstrate that appropriate sensory feedback in the CPG-based control architecture can he acquired using the proposed method within a thousand trials by numerical simulations. We analyze linear stability of a periodic orbit of the acquired biped walking considering a return map. Furthermore, we apply the learned controllers in numerical simulations to our physical 5-link robot in order to empirically evaluate the effectiveness of the proposed framework. Experimental results suggest the robustness of the acquired controllers against environmental changes and variations in the mass properties of the robot.

引用

页码：208 / 213

页数：6

共 50 条

[1] Learning CPG-based biped locomotion with a policy gradient method
Matsubara, Takamitsu
Morimoto, Jun
Nakanishi, Jun
Sato, Masa-aki
Doya, Kenji
ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (11) : 911 - 920
[2] Learning CPG-based biped locomotion with a policy gradient method: Application to a humanoid robot
Endo, Gen
Morimoto, Jun
Matsubara, Takamitsu
Nakanishi, Jun
Cheng, Gordon
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02) : 213 - 228
[3] Learning sensory feedback to CPG with policy gradient for biped locomotion
Matsubara, T
Morimoto, J
Nakanishi, J
Sato, MA
Doya, K
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4164 - 4169
[4] Biped Locomotion Control through a Biomimetic CPG-based Controller
Santos, Cristina P.
Alves, Nuno
Moreno, Juan C.
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (01) : 47 - 70
[5] Biped Locomotion Control through a Biomimetic CPG-based Controller
Cristina P. Santos
Nuno Alves
Juan C. Moreno
Journal of Intelligent & Robotic Systems, 2017, 85 : 47 - 70
[6] A CPG-based control method for the rolling locomotion of a desert spider
Shi, Ruidong
Zhang, Xiuli
Tian, Yaobin
Dong, Shouyang
Yao, Yan'an
2016 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS (ARSO), 2016, : 243 - 248
[7] A hybrid controller based on CPG and ZMP for biped locomotion
Massah, Amir B.
Zamani, Ali
Salehinia, Yaser
Aliyari, Mahdi Sh
Teshnehlab, Mohammad
JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2013, 27 (11) : 3473 - 3486
[8] FPGA implementation of a configurable neuromorphic CPG-based locomotion controller
Hugo Barron-Zambrano, Jose
Torres-Huitzil, Cesar
NEURAL NETWORKS, 2013, 45 : 50 - 61
[9] An Approach for Adaptive Limbless Locomotion Using a CPG-Based Reflex Mechanism
Li, Guoyuan
Zhang, Houxiang
Zhang, Jianwei
Hildre, Hans Petter
JOURNAL OF BIONIC ENGINEERING, 2014, 11 (03) : 389 - 399
[10] A CPG-based Locomotion Control Architecture for Hexapod Robot
Yu, Haitao
Guo, Wei
Deng, Jing
Li, Mantian
Cai, Hegao
2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 5615 - 5621

← 1 2 3 4 5 →