Learning quadrupedal locomotion on tough terrain using an asymmetric terrain feature mining network

被引:0
作者
Zuo, Guoyu [1 ,2 ]
Wang, Yong [1 ,2 ]
Gong, Daoxiong [1 ,2 ]
Yu, Shuangyue [1 ,2 ]
机构
[1] Beijing Key Lab Comp Intelligence & Intelligent Sy, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
关键词
Quadruped robot; Robot learning; Motion stability; Tough terrain; Terrain feature mining network; GROUND SEGMENTATION; ROBOTS;
D O I
10.1007/s10489-024-05782-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of robust and agile locomotion skills for legged robots using reinforcement learning is challenging, particularly in demanding environments. In this study, we propose a blind locomotion control learning framework that enables fast and stable walking on challenging terrains. First, we construct an asymmetric terrain feature extraction network that uses a multilayer perceptron to effectively infer terrain features from the history of proprioceptive states, consisting only of inertial measurement unit and joint encoder data. Additionally, our asymmetric actor-critic framework implicitly infers terrain features, thereby enhancing the accuracy of terrain representation. Second, we introduce a foot trajectory generator based on prior gait behaviors, which improves the gait periodicity and provides accurate state information for terrain feature inference. Compared to state-of-the-art methods, our approach significantly increases the learning efficiency by 26.0% and enhances terrain adaptation by 5.0%. It also achieved a more periodic gait, with the state-command tracking error reduced by 38.5% compared with advanced methods. The success rate for traversing complex terrains was similar to that of the baseline methods, with a 31.3% increase in the step height on stair-like terrains. The experimental results demonstrate that the proposed method enables fast and stable walking on challenging terrains.
引用
收藏
页码:11547 / 11563
页数:17
相关论文
共 40 条
[1]  
Agarwal A., 2023, P C ROB LEARN, P403
[2]   Learning dexterous in-hand manipulation [J].
Andrychowicz, Marcin ;
Baker, Bowen ;
Chociej, Maciek ;
Jozefowicz, Rafal ;
McGrew, Bob ;
Pachocki, Jakub ;
Petron, Arthur ;
Plappert, Matthias ;
Powell, Glenn ;
Ray, Alex ;
Schneider, Jonas ;
Sidor, Szymon ;
Tobin, Josh ;
Welinder, Peter ;
Weng, Lilian ;
Zaremba, Wojciech .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20
[3]   Development of quadruped walking robots: A review [J].
Biswal, Priyaranjan ;
Mohanty, Prases K. .
AIN SHAMS ENGINEERING JOURNAL, 2021, 12 (02) :2017-2031
[4]   Learning One-Shot Imitation From Humans Without Humans [J].
Bonardi, Alessandro ;
James, Stephen ;
Davison, Andrew J. .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :3533-3539
[5]   Learning quadrupedal locomotion on deformable terrain [J].
Choi, Suyoung ;
Ji, Gwanghyeon ;
Park, Jeongsoo ;
Kim, Hyeongjun ;
Mun, Juhyeok ;
Lee, Jeong Hyun ;
Hwangbo, Jemin .
SCIENCE ROBOTICS, 2023, 8 (74)
[6]  
Ding YR, 2019, IEEE INT CONF ROBOT, P8484, DOI [10.1109/ICRA.2019.8793669, 10.1109/icra.2019.8793669]
[7]   Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions [J].
Escontrela, Alejandro ;
Peng, Xue Bin ;
Yu, Wenhao ;
Zhang, Tingnan ;
Iscen, Atil ;
Goldberg, Ken ;
Abbeel, Pieter .
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :25-32
[8]   STANCE: Locomotion Adaptation Over Soft Terrain [J].
Fahmi, Shamel ;
Focchi, Michele ;
Radulescu, Andreea ;
Fink, Geoff ;
Barasuol, Victor ;
Semini, Claudio .
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (02) :443-457
[9]  
Fu Z., 2022, C ROB LEARN, P928
[10]  
Gehring C., 2021, FIELD SERVICE ROBOTI, P247, DOI DOI 10.1007/978-981-15-9460-1_18