Learning Quadrupedal High-Speed Running on Uneven Terrain

被引:1
作者
Han, Xinyu [1 ]
Zhao, Mingguo [2 ]
Chen, Xuechao
Ma, Gan
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing Innovat Ctr Future Chips, Beijing 100084, Peoples R China
关键词
reinforcement learning; quadrupedal robot; high-speed locomotion;
D O I
10.3390/biomimetics9010037
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Reinforcement learning (RL)-based controllers have been applied to the high-speed movement of quadruped robots on uneven terrains. The external disturbances increase as the robot moves faster on such terrains, affecting the stability of the robot. Many existing RL-based methods adopt higher control frequencies to respond quickly to the disturbance, which requires a significant computational cost. We propose a control framework that consists of an RL-based control policy updating at a low frequency and a model-based joint controller updating at a high frequency. Unlike previous methods, our policy outputs the control law for each joint, executed by the corresponding high-frequency joint controller to reduce the impact of external disturbances on the robot. We evaluated our method on various simulated terrains with height differences of up to 6 cm. We achieved a running motion of 1.8 m/s in the simulation using the Unitree A1 quadruped. The RL-based control policy updates at 50 Hz with a latency of 20 ms, while the model-based joint controller runs at 1000 Hz. The experimental results show that the proposed framework can overcome the latency caused by low-frequency updates, making it applicable for real-robot deployment.
引用
收藏
页数:14
相关论文
共 31 条
[1]  
[Anonymous], 2023, Unitree A1
[2]  
Bloesch M., 2013, P ROB SCI SYST, V17, P17, DOI DOI 10.7551/MITPRESS/9816.003.0008
[3]   Trajectory Optimization for Legged Robots With Slipping Motions [J].
Carius, Jan ;
Ranftl, Rene ;
Koltun, Vladlen ;
Hutter, Marco .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) :3013-3020
[4]   Learning quadrupedal locomotion on deformable terrain [J].
Choi, Suyoung ;
Ji, Gwanghyeon ;
Park, Jeongsoo ;
Kim, Hyeongjun ;
Mun, Juhyeok ;
Lee, Jeong Hyun ;
Hwangbo, Jemin .
SCIENCE ROBOTICS, 2023, 8 (74)
[5]  
Clevert DA, 2016, Arxiv, DOI [arXiv:1511.07289, DOI 10.48550/ARXIV.1511.07289]
[6]  
Dao J, 2022, Arxiv, DOI arXiv:2204.04340
[7]  
Di Carlo J, 2018, IEEE INT C INT ROBOT, P7440, DOI 10.1109/IROS.2018.8594448
[8]   Learning Dynamic Bipedal Walking Across Stepping Stones [J].
Duan, Helei ;
Malik, Ashish ;
Gadde, Mohitvishnu S. ;
Dao, Jeremy ;
Fern, Alan ;
Hurst, Jonathan .
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :6746-6752
[9]  
Haserbek Tastulek, 2022, 2022 IEEE International Conference on Real-time Computing and Robotics (RCAR), P699, DOI 10.1109/RCAR54675.2022.9872190
[10]   High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning [J].
Jin, Yongbin ;
Liu, Xianwei ;
Shao, Yecheng ;
Wang, Hongtao ;
Yang, Wei .
NATURE MACHINE INTELLIGENCE, 2022, 4 (12) :1198-1208