Controlling the Solo12 quadruped robot with deep reinforcement learning

被引:17
作者
Aractingi, Michel [1 ,2 ]
Leziart, Pierre-Alexandre [1 ]
Flayols, Thomas [1 ]
Perez, Julien [2 ]
Silander, Tomi [2 ]
Soueres, Philippe [1 ]
机构
[1] Univ Toulouse, CNRS, LAAS, F-31400 Toulouse, France
[2] NAVER LABS Europe, F-38240 Meylan, France
关键词
LOCOMOTION;
D O I
10.1038/s41598-023-38259-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Quadruped robots require robust and general locomotion skills to exploit their mobility potential in complex and challenging environments. In this work, we present an implementation of a robust end-to-end learning-based controller on the Solo12 quadruped. Our method is based on deep reinforcement learning of joint impedance references. The resulting control policies follow a commanded velocity reference while being efficient in its energy consumption and easy to deploy. We detail the learning procedure and method for transfer on the real robot. We show elaborate experiments. Finally, we present experimental results of the learned locomotion on various grounds indoors and outdoors. These results show that the Solo12 robot is a suitable open-source platform for research combining learning and control because of the easiness in transferring and deploying learned controllers.
引用
收藏
页数:12
相关论文
共 45 条
[1]  
Bellicoso C. Dario, 2018, IEEE Robotics and Automation Letters, V3, P2261, DOI [10.1109/lra.2018.2794620, 10.1109/LRA.2018.2794620]
[2]   Biped dynamic walking using reinforcement learning [J].
Benbrahim, H ;
Franklin, JA .
ROBOTICS AND AUTONOMOUS SYSTEMS, 1997, 22 (3-4) :283-302
[3]  
Bengio Y., 2009, P 26 ANN INT C MACHI, P41, DOI DOI 10.1145/1553374.1553380
[4]  
Di Carlo J, 2018, IEEE INT C INT ROBOT, P7440, DOI 10.1109/IROS.2018.8594448
[5]  
Dynamics, SPOT MIN
[6]  
Fadini G., 2021, IEEE INT C ROB AUT I
[7]   Catastrophic forgetting in connectionist networks [J].
French, RM .
TRENDS IN COGNITIVE SCIENCES, 1999, 3 (04) :128-135
[8]  
Fuchioka Y, 2022, Arxiv, DOI [arXiv:2210.01247, 10.48550/ARXIV.2210.01247]
[9]   An Open Torque-Controlled Modular Robot Architecture for Legged Locomotion Research [J].
Grimminger, Felix ;
Meduri, Avadesh ;
Khadiv, Majid ;
Viereck, Julian ;
Wuthrich, Manuel ;
Naveau, Maximilien ;
Berenz, Vincent ;
Heim, Steve ;
Widmaier, Felix ;
Flayols, Thomas ;
Fiene, Jonathan ;
Badri-Sproewitz, Alexander ;
Righetti, Ludovic .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :3650-3657
[10]   SKILLFUL CONTROL UNDER UNCERTAINTY VIA DIRECT REINFORCEMENT LEARNING [J].
GULLAPALLI, V .
ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (04) :237-246