Reinforcement Learning for Quadrupedal Locomotion: Current Advancements and Future Perspectives

被引：0

作者：

Gurram, Maurya ^{[1
]}

Uttam, Prakash Kumar ^{[2
]}

Ohol, Shantipal S. ^{[3
]}

机构：

[1] Vellore Inst Technol AP, Comp Sci & Engn, Amaravati, India

[2] Indian Inst Sci, Elect Elect & Comp Sci, Bangalore, Karnataka, India

[3] COEP Univ, Pune, Maharashtra, India

来源：

2025 9TH INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND ROBOTICS RESEARCH, ICMERR | 2025年

关键词：

Reinforcement Learning; Control Theory; Quadrupedal Locomotion;

D O I：

10.1109/ICMERR64601.2025.10949908

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

In recent years, reinforcement learning (RL) based quadrupedal locomotion control has emerged as an extensively researched field, driven by the potential advantages of autonomous learning and adaptation compared to traditional control methods. This paper provides a comprehensive study of the latest research in applying RL techniques to develop locomotion controllers for quadrupedal robots. We present a detailed overview of the core concepts, methodologies, and key advancements in RL-based locomotion controllers, including learning algorithms, training curricula, reward formulations, and simulation-to-real transfer techniques. The study covers both gait-bound and gait-free approaches, highlighting their respective strengths and limitations. Additionally, we discuss the integration of these controllers with robotic hardware and the role of sensor feedback in enabling adaptive behavior. The paper also outlines future research directions, such as incorporating exteroceptive sensing, combining model-based and model-free techniques, and developing online learning capabilities. Our study aims to provide researchers and practitioners with a comprehensive understanding of the state-of-the-art in RL-based locomotion controllers, enabling them to build upon existing work and explore novel solutions for enhancing the mobility and adaptability of quadrupedal robots in real-world environments.

引用

页码：28 / 38

页数：11

共 36 条

[1]

Agarwal A., 2022, arXiv

[2]

[Anonymous], 2023, Demonstrating a Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning

[3]

Chen D., 2019, LEARNING CHEATING

[4]

Di Carlo J, 2018, IEEE INT C INT ROBOT, P7440, DOI 10.1109/IROS.2018.8594448

[5]

Ding YR, 2019, IEEE INT CONF ROBOT, P8484, DOI [10.1109/icra.2019.8793669, 10.1109/ICRA.2019.8793669]

[6]

Dong DY, 2008, Arxiv, DOI arXiv:0810.3828

[7]

Fu Z., 2021, arXiv

[8] Coupling Vision and Proprioception for Navigation of Legged Robots [J].

Fu, Zipeng ;

Kumar, Ashish ;

Agarwal, Ananye ;

Qi, Haozhi ;

Malik, Jitendra ;

Pathak, Deepak .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :17252-17262

[9]

Haarnoja T, 2019, Arxiv, DOI arXiv:1812.11103

[10] Learning Quadrupedal High-Speed Running on Uneven Terrain [J].

Han, Xinyu ;

Zhao, Mingguo ;

Chen, Xuechao ;

Ma, Gan .

BIOMIMETICS, 2024, 9 (01)

← 1 2 3 4 →