Reinforcement Learning-based Hierarchical Control for Path Following of a Salamander-like Robot

被引:4
|
作者
Zhang, Xueyou [1 ]
Guo, Xian [1 ]
Fang, Yongchun [1 ]
Zhu, Wei [1 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Inst Robot & Automat Informat Syst, Tianjin, Peoples R China
来源
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2020年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/IROS45743.2020.9341656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Path following is a challenging task for legged robots. In this paper, we present a hierarchical control architecture for path following of a quadruped salamander-like robot, in which, the tracking problem is decomposed into two sub-tasks: high-level policy learning based on the framework of reinforcement learning (RL) and low-level traditional controller design. More specifically, the high-level policy is learned in a physics simulator with a low-level controller designed in advance. To improve the tracking accuracy and to eliminate static errors, a soft Actor-Critic algorithm with state integral compensation is proposed. Additionally, to enhance the generalization and transferability, a compact state representation, which only contains the information of the target path and the abstract action similar to front-back and left-right, is proposed. The proposed algorithm is trained offline in the simulation environment and tested on the self-developed real quadruped salamander-like robot for different path following tasks. Simulation and experiments results validate the satisfactory performance of the proposed method.
引用
收藏
页码:6077 / 6083
页数:7
相关论文
共 50 条
  • [1] Reinforcement Learning-based Path Following Control for a Vehicle with Variable Delay in the Drivetrain
    Ultsch, Johannes
    Mirwald, Jonas
    Brembeck, Jonathan
    de Castro, Ricardo
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 532 - 539
  • [2] A Novel Design Methodology of CPG Model for a Salamander-Like Robot
    Liu, Yang
    Liu, Zhiang
    Fang, Yongchun
    Liu, Huawang
    Guo, Xian
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6115 - 6122
  • [3] Coordinated Body Bending Improves Performance of a Salamander-like Robot
    Ozkan Aydin, Y.
    Rieser, J. M.
    Gong, C.
    Michel, K.
    Rankin, J.
    Nicieza, A. G.
    Hutchinson, J. R.
    Choset, H.
    Goldman, D., I
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2017, 57 : E372 - E372
  • [4] Reinforcement Learning-Based High-Speed Path Following Control for Autonomous Vehicles
    Liu, Jia
    Cui, Yunduan
    Duan, Jianghua
    Jiang, Zhengmin
    Pan, Zhongming
    Xu, Kun
    Li, Huiyun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 7603 - 7615
  • [5] Interfacing a Salamander Brain with a Salamander-like Robot: Control of Speed and Direction with Calcium Signals from Brainstem Reticulospinal Neurons
    Ryczko, Dimitri
    Thandiackal, Robin
    Ijspeert, Auke J.
    2016 6TH IEEE INTERNATIONAL CONFERENCE ON BIOMEDICAL ROBOTICS AND BIOMECHATRONICS (BIOROB), 2016, : 1140 - 1147
  • [6] Hybrid learning-based visual path following for an industrial robot
    Bingol, Mustafa Can
    Aydogmus, Omur
    ROBOTICA, 2024, 42 (11) : 3888 - 3903
  • [7] Mobile robot navigation control based on hierarchical reinforcement learning
    Department of Automation, University of Science and Technology of China, Hefei 230027, China
    Nanjing Hangkong Hangtian Daxue Xuebao, 2006, 1 (70-75):
  • [8] Reinforcement Learning-Based Tracking Control For Wheeled Mobile Robot
    Nguyen Tan Luy
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 462 - 467
  • [9] Deep reinforcement learning-based pitch attitude control of a beaver-like underwater robot
    Chen, Gang
    Zhao, Zhihan
    Lu, Yuwang
    Yang, Chenguang
    Hu, Huosheng
    OCEAN ENGINEERING, 2024, 307
  • [10] Reinforcement Learning-Based Path following Control with Dynamics Randomization for Parametric Uncertainties in Autonomous Driving
    Ahmic, Kenan
    Ultsch, Johannes
    Brembeck, Jonathan
    Winter, Christoph
    APPLIED SCIENCES-BASEL, 2023, 13 (06):