Adaptive Gait Generation for Hexapod Robots Based on Reinforcement Learning and Hierarchical Framework

被引：7

作者：

Qiu, Zhiying ^{[1
]}

Wei, Wu ^{[1
,2
]}

Liu, Xiongding ^{[1
]}

机构：

[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China

[2] South China Univ Technol, Unmanned Aerial Vehicle Syst Engn Technol Res Ctr, Key Lab Autonomous Syst & Networked Control, Minist Educ, Guangzhou 510641, Peoples R China

来源：

ACTUATORS | 2023年 / 12卷 / 02期

基金：

中国国家自然科学基金;

关键词：

hexapod robot; reinforcement learning; hierarchical framework; gait generation; ENVIRONMENT;

D O I：

10.3390/act12020075

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

Gait plays a decisive role in the performance of hexapod robot walking; this paper focuses on adaptive gait generation with reinforcement learning for a hexapod robot. Moreover, the hexapod robot has a high-dimensional action space and therefore it is a great challenge to use reinforcement learning to directly train the robot's joint angles. As a result, a hierarchical and modular framework and learning details are proposed in this paper, using only seven-dimensional vectors to denote the agent actions. In addition, we conduct experiments and deploy the proposed framework using a real hexapod robot. The experimental results show that superior reinforcement learning algorithms can converge in our framework, such as SAC, PPO, DDPG and TD3. Specifically, the gait policy trained in our framework can generate new adaptive hexapod gait on flat terrain, which is stable and has lower transportation cost than rhythmic gaits.

引用

页数：15

共 39 条

[1] Blind Hexapod Locomotion in Complex Terrain with Gait Adaptation Using Deep Reinforcement Learning and Classification [J].

Azayev, Teymur ;

Zimmerman, Karel .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (3-4) :659-671

[2] Control strategy of stable walking for a hexapod wheel-legged robot [J].

Chen, Zhihua ;

Wang, Shoukun ;

Wang, Junzheng ;

Xu, Kang ;

Lei, Tao ;

Zhang, Hao ;

Wang, Xiuwen ;

Liu, Daohe ;

Si, Jinge .

ISA TRANSACTIONS, 2021, 108 :367-380

[3] Trends in the Control of Hexapod Robots: A Survey [J].

Coelho, Joana ;

Ribeiro, Fernando ;

Dias, Bruno ;

Lopes, Gil ;

Flores, Paulo .

ROBOTICS, 2021, 10 (03)

[4] Design and implementation of bio inspired hexapod for exploration applications [J].

Deepa, T. ;

Angalaeswari, S. ;

Subbulekshmi, D. ;

Krithiga, S. ;

Sujeeth, S. ;

Kathiravan, Raja .

MATERIALS TODAY-PROCEEDINGS, 2021, 37 :1603-1607

[5] Intelligent problem-solving as integrated hierarchical reinforcement learning [J].

Eppe, Manfred ;

Gumbsch, Christian ;

Kerzel, Matthias ;

Nguyen, Phuong D. H. ;

Butz, Martin, V ;

Wermter, Stefan .

NATURE MACHINE INTELLIGENCE, 2022, 4 (01) :11-20

[6] Modeling and Simulation of Frictional Contacts in Multi-rigid-Body Systems [J].

Flores, Paulo .

MULTIBODY MECHATRONIC SYSTEMS (MUSME 2021), 2022, 110 :77-84

[7]

Fu HQ, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P2381

[8] Intersegmental coordination of cockroach locomotion: adaptive control of centrally coupled pattern generator circuits [J].

Fuchs, Einat ;

Holmes, Philip ;

Kiemel, Tim ;

Ayali, Amir .

FRONTIERS IN NEURAL CIRCUITS, 2011, 4

[9]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[10] Trajectory tracking of multi-legged robot based on model predictive and sliding mode control [J].

Gao, Yong ;

Wei, Wu ;

Wang, Xinmei ;

Wang, Dongliang ;

Li, Yanjie ;

Yu, Qiuda .

INFORMATION SCIENCES, 2022, 606 :489-511

← 1 2 3 4 →