Adaptive Gait Generation for Hexapod Robots Based on Reinforcement Learning and Hierarchical Framework

被引:7
作者
Qiu, Zhiying [1 ]
Wei, Wu [1 ,2 ]
Liu, Xiongding [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] South China Univ Technol, Unmanned Aerial Vehicle Syst Engn Technol Res Ctr, Key Lab Autonomous Syst & Networked Control, Minist Educ, Guangzhou 510641, Peoples R China
基金
中国国家自然科学基金;
关键词
hexapod robot; reinforcement learning; hierarchical framework; gait generation; ENVIRONMENT;
D O I
10.3390/act12020075
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Gait plays a decisive role in the performance of hexapod robot walking; this paper focuses on adaptive gait generation with reinforcement learning for a hexapod robot. Moreover, the hexapod robot has a high-dimensional action space and therefore it is a great challenge to use reinforcement learning to directly train the robot's joint angles. As a result, a hierarchical and modular framework and learning details are proposed in this paper, using only seven-dimensional vectors to denote the agent actions. In addition, we conduct experiments and deploy the proposed framework using a real hexapod robot. The experimental results show that superior reinforcement learning algorithms can converge in our framework, such as SAC, PPO, DDPG and TD3. Specifically, the gait policy trained in our framework can generate new adaptive hexapod gait on flat terrain, which is stable and has lower transportation cost than rhythmic gaits.
引用
收藏
页数:15
相关论文
共 39 条
[31]   Circus ANYmal: A Quadruped Learning Dexterous Manipulation with Its Limbs [J].
Shi, Fan ;
Homberger, Timon ;
Lee, Joonho ;
Miki, Takahiro ;
Zhao, Moju ;
Farshidian, Farbod ;
Okada, Kei ;
Inaba, Masayuki ;
Hutter, Marco .
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, :2316-2323
[32]   Towards dynamic alternating tripod trotting of a pony-sized hexapod robot for disaster rescuing based on multi-modal impedance control [J].
Sun, Qiao ;
Gao, Feng ;
Chen, Xianbao .
ROBOTICA, 2018, 36 (07) :1048-1076
[33]  
Tan J, 2018, Arxiv, DOI arXiv:1804.10332
[34]   An intelligent hexapod robot for inspection of airframe components oriented by deep learning technique [J].
Teixeira Vivaldini, Kelen C. ;
Franco Barbosa, Gustavo ;
Santos, Igor Araujo Dias ;
Kim, Pedro H. C. ;
McMichael, Grayson ;
Guerra-Zubiaga, David A. .
JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2021, 43 (11)
[35]   Versatile modular neural locomotion control with fast learning [J].
Thor, Mathias ;
Manoonpong, Poramate .
NATURE MACHINE INTELLIGENCE, 2022, 4 (02) :169-179
[36]   DeepGait: Planning and Control of Quadrupedal Gaits Using Deep Reinforcement Learning [J].
Tsounis, Vassilios ;
Alge, Mitja ;
Lee, Joonho ;
Farshidian, Farbod ;
Hutter, Marco .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :3699-3706
[37]   Contact Sequence Planning for Hexapod Robots in Sparse Foothold Environment Based on Monte-Carlo Tree [J].
Xu, Peng ;
Ding, Liang ;
Wang, Zhikai ;
Gao, Haibo ;
Zhou, Ruyi ;
Gong, Zhaopei ;
Liu, Guangjun .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) :826-833
[38]   Proximal Policy Optimization with Mixed Distributed Training [J].
Zhang, Zhenyu ;
Luo, Xiangfeng ;
Liu, Tong ;
Xie, Shaorong ;
Wang, Jianshu ;
Wang, Wei ;
Li, Yang ;
Peng, Yan .
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, :1452-1456
[39]  
Zhongbin Cai, 2021, Journal of Physics: Conference Series, V1754, DOI [10.1088/1742-6596/1754/1/012157, 10.1088/1742-6596/1754/1/012157]