Learning Quadrupedal High-Speed Running on Uneven Terrain

被引：1

作者：

Han, Xinyu ^{[1
]}

Zhao, Mingguo ^{[2
]}

Chen, Xuechao

Ma, Gan

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

[2] Tsinghua Univ, Beijing Innovat Ctr Future Chips, Beijing 100084, Peoples R China

来源：

BIOMIMETICS | 2024年 / 9卷 / 01期

关键词：

reinforcement learning; quadrupedal robot; high-speed locomotion;

D O I：

10.3390/biomimetics9010037

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Reinforcement learning (RL)-based controllers have been applied to the high-speed movement of quadruped robots on uneven terrains. The external disturbances increase as the robot moves faster on such terrains, affecting the stability of the robot. Many existing RL-based methods adopt higher control frequencies to respond quickly to the disturbance, which requires a significant computational cost. We propose a control framework that consists of an RL-based control policy updating at a low frequency and a model-based joint controller updating at a high frequency. Unlike previous methods, our policy outputs the control law for each joint, executed by the corresponding high-frequency joint controller to reduce the impact of external disturbances on the robot. We evaluated our method on various simulated terrains with height differences of up to 6 cm. We achieved a running motion of 1.8 m/s in the simulation using the Unitree A1 quadruped. The RL-based control policy updates at 50 Hz with a latency of 20 ms, while the model-based joint controller runs at 1000 Hz. The experimental results show that the proposed framework can overcome the latency caused by low-frequency updates, making it applicable for real-robot deployment.

引用

页数：14

共 31 条

[1]

[Anonymous], 2023, Unitree A1

[2]

Bloesch M., 2013, P ROB SCI SYST, V17, P17, DOI DOI 10.7551/MITPRESS/9816.003.0008

[3] Trajectory Optimization for Legged Robots With Slipping Motions [J].

Carius, Jan ;

Ranftl, Rene ;

Koltun, Vladlen ;

Hutter, Marco .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) :3013-3020

[4] Learning quadrupedal locomotion on deformable terrain [J].

Choi, Suyoung ;

Ji, Gwanghyeon ;

Park, Jeongsoo ;

Kim, Hyeongjun ;

Mun, Juhyeok ;

Lee, Jeong Hyun ;

Hwangbo, Jemin .

SCIENCE ROBOTICS, 2023, 8 (74)

[5]

Clevert DA, 2016, Arxiv, DOI [arXiv:1511.07289, DOI 10.48550/ARXIV.1511.07289]

[6]

Dao J, 2022, Arxiv, DOI arXiv:2204.04340

[7]

Di Carlo J, 2018, IEEE INT C INT ROBOT, P7440, DOI 10.1109/IROS.2018.8594448

[8] Learning Dynamic Bipedal Walking Across Stepping Stones [J].

Duan, Helei ;

Malik, Ashish ;

Gadde, Mohitvishnu S. ;

Dao, Jeremy ;

Fern, Alan ;

Hurst, Jonathan .

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :6746-6752

[9]

Haserbek Tastulek, 2022, 2022 IEEE International Conference on Real-time Computing and Robotics (RCAR), P699, DOI 10.1109/RCAR54675.2022.9872190

[10] High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning [J].

Jin, Yongbin ;

Liu, Xianwei ;

Shao, Yecheng ;

Wang, Hongtao ;

Yang, Wei .

NATURE MACHINE INTELLIGENCE, 2022, 4 (12) :1198-1208

← 1 2 3 4 →