Offline Reinforcement Learning of Robotic Control Using Deep Kinematics and Dynamics

被引:3
作者
Li, Xiang [1 ]
Shang, Weiwei [1 ]
Cong, Shuang [1 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Computed-torque controller; kinematic and dynamic model learning; model-based reinforcement learning (MBRL); robotic control; trajectory tracking; NEURAL-NETWORKS;
D O I
10.1109/TMECH.2023.3336316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of deep learning, model-free reinforcement learning algorithms have achieved remarkable results in many fields. However, their high sample complexity and the potential for causing damage to environments and robots pose severe challenges for their application in real-world environments. Model-based reinforcement learning algorithms are often used to reduce the sample complexity. One limitation of these algorithms is the inevitable modeling errors. While the black-box model can fit complex state transition models, it ignores the existing knowledge of physics and robotics, especially studies of kinematic and dynamic models of the robotic manipulator. Compared with the black-box model, the physics-inspired deep models do not require specific knowledge of each system to obtain interpretable kinematic and dynamic models. In model-based reinforcement learning, these models can simulate the motion and be combined with classical controllers. This is due to their sharing the same form as traditional models, leading to higher precision tracking results. In this work, we utilize physics-inspired deep models to learn the kinematics and dynamics of a robotic manipulator. We propose a model-based offline reinforcement learning algorithm for controller parameter learning, combined with the traditional computed-torque controller. Experiments on trajectory tracking control of the Baxter manipulator, both in joint and operational space, are conducted in simulation and real environments. Experimental results demonstrate that our algorithm can significantly improve tracking accuracy and exhibits strong generalization and robustness.
引用
收藏
页码:2428 / 2439
页数:12
相关论文
共 34 条
  • [1] [Anonymous], 2020, Robot modeling and control
  • [2] Chen R. T., 2018, Advances in Neural Information Processing Systems, V31, P6572
  • [3] Chua K, 2018, 32 C NEURAL INFORM P
  • [4] Clavera I., 2018, C ROBOT LEARNING, P617
  • [5] Cranmer M., 2020, P INT C LEARN REPR, P1
  • [6] Deisenroth M., 2011, ICML
  • [7] Deisenroth MP, 2012, ROBOTICS: SCIENCE AND SYSTEMS VII, P57
  • [8] A General Approach for the Automation of Hydraulic Excavator Arms Using Reinforcement Learning
    Egli, Pascal
    Hutter, Marco
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 5679 - 5686
  • [9] Fang GX, 2022, IEEE-ASME T MECH, V27, P5296, DOI [10.1109/TMECH.2022.3178303, 10.1109/IECON49645.2022.9968639]
  • [10] Finzi M., 2020, Advances in Neural Information Processing Systems, V33