Offline Reinforcement Learning of Robotic Control Using Deep Kinematics and Dynamics

被引:3
作者
Li, Xiang [1 ]
Shang, Weiwei [1 ]
Cong, Shuang [1 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Computed-torque controller; kinematic and dynamic model learning; model-based reinforcement learning (MBRL); robotic control; trajectory tracking; NEURAL-NETWORKS;
D O I
10.1109/TMECH.2023.3336316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of deep learning, model-free reinforcement learning algorithms have achieved remarkable results in many fields. However, their high sample complexity and the potential for causing damage to environments and robots pose severe challenges for their application in real-world environments. Model-based reinforcement learning algorithms are often used to reduce the sample complexity. One limitation of these algorithms is the inevitable modeling errors. While the black-box model can fit complex state transition models, it ignores the existing knowledge of physics and robotics, especially studies of kinematic and dynamic models of the robotic manipulator. Compared with the black-box model, the physics-inspired deep models do not require specific knowledge of each system to obtain interpretable kinematic and dynamic models. In model-based reinforcement learning, these models can simulate the motion and be combined with classical controllers. This is due to their sharing the same form as traditional models, leading to higher precision tracking results. In this work, we utilize physics-inspired deep models to learn the kinematics and dynamics of a robotic manipulator. We propose a model-based offline reinforcement learning algorithm for controller parameter learning, combined with the traditional computed-torque controller. Experiments on trajectory tracking control of the Baxter manipulator, both in joint and operational space, are conducted in simulation and real environments. Experimental results demonstrate that our algorithm can significantly improve tracking accuracy and exhibits strong generalization and robustness.
引用
收藏
页码:2428 / 2439
页数:12
相关论文
共 34 条
  • [21] Lutter M, 2019, IEEE INT C INT ROBOT, P7718, DOI [10.1109/IROS40897.2019.8968268, 10.1109/iros40897.2019.8968268]
  • [22] Martín-Martín R, 2019, IEEE INT C INT ROBOT, P1010, DOI [10.1109/iros40897.2019.8968201, 10.1109/IROS40897.2019.8968201]
  • [23] Human-level control through deep reinforcement learning
    Mnih, Volodymyr
    Kavukcuoglu, Koray
    Silver, David
    Rusu, Andrei A.
    Veness, Joel
    Bellemare, Marc G.
    Graves, Alex
    Riedmiller, Martin
    Fidjeland, Andreas K.
    Ostrovski, Georg
    Petersen, Stig
    Beattie, Charles
    Sadik, Amir
    Antonoglou, Ioannis
    King, Helen
    Kumaran, Dharshan
    Wierstra, Daan
    Legg, Shane
    Hassabis, Demis
    [J]. NATURE, 2015, 518 (7540) : 529 - 533
  • [24] Nagabandi A, 2018, IEEE INT CONF ROBOT, P7579
  • [25] Operational space control: A theoretical and empirical comparison
    Nakanishi, Jun
    Cory, Rick
    Mistry, Michael
    Peters, Jan
    Schaal, Stefan
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (06) : 737 - 757
  • [26] Analytic Deep Neural Network-Based Robot Control
    Nguyen, Huu-Thiet
    Cheah, Chien Chern
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (04) : 2176 - 2184
  • [27] Real-Robot Deep Reinforcement Learning: Improving Trajectory Tracking of Flexible-Joint Manipulator with Reference Correction
    Pavlichenko, Dmytro
    Behnke, Sven
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2671 - 2677
  • [28] Analyzing Neural Jacobian Methods in Applications of Visual Servoing and Kinematic Control
    Przystupa, Michael
    Dehghan, Masood
    Jagersand, Martin
    Mahmood, A. Rupam
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14276 - 14283
  • [29] Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
    Raissi, M.
    Perdikaris, P.
    Karniadakis, G. E.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2019, 378 : 686 - 707
  • [30] Rueckert E, 2017, 2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), P811, DOI 10.1109/HUMANOIDS.2017.8246965