Offline Reinforcement Learning of Robotic Control Using Deep Kinematics and Dynamics

被引：3

作者：

Li, Xiang ^{[1
]}

Shang, Weiwei ^{[1
]}

Cong, Shuang ^{[1
]}

机构：

[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China

来源：

IEEE-ASME TRANSACTIONS ON MECHATRONICS | 2024年 / 29卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Computed-torque controller; kinematic and dynamic model learning; model-based reinforcement learning (MBRL); robotic control; trajectory tracking; NEURAL-NETWORKS;

D O I：

10.1109/TMECH.2023.3336316

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of deep learning, model-free reinforcement learning algorithms have achieved remarkable results in many fields. However, their high sample complexity and the potential for causing damage to environments and robots pose severe challenges for their application in real-world environments. Model-based reinforcement learning algorithms are often used to reduce the sample complexity. One limitation of these algorithms is the inevitable modeling errors. While the black-box model can fit complex state transition models, it ignores the existing knowledge of physics and robotics, especially studies of kinematic and dynamic models of the robotic manipulator. Compared with the black-box model, the physics-inspired deep models do not require specific knowledge of each system to obtain interpretable kinematic and dynamic models. In model-based reinforcement learning, these models can simulate the motion and be combined with classical controllers. This is due to their sharing the same form as traditional models, leading to higher precision tracking results. In this work, we utilize physics-inspired deep models to learn the kinematics and dynamics of a robotic manipulator. We propose a model-based offline reinforcement learning algorithm for controller parameter learning, combined with the traditional computed-torque controller. Experiments on trajectory tracking control of the Baxter manipulator, both in joint and operational space, are conducted in simulation and real environments. Experimental results demonstrate that our algorithm can significantly improve tracking accuracy and exhibits strong generalization and robustness.

引用

页码：2428 / 2439

页数：12

共 34 条

[21] Lutter M, 2019, IEEE INT C INT ROBOT, P7718, DOI [10.1109/IROS40897.2019.8968268, 10.1109/iros40897.2019.8968268]
[22] Martín-Martín R, 2019, IEEE INT C INT ROBOT, P1010, DOI [10.1109/iros40897.2019.8968201, 10.1109/IROS40897.2019.8968201]
[23] Human-level control through deep reinforcement learning
Mnih, Volodymyr
Kavukcuoglu, Koray
Silver, David
Rusu, Andrei A.
Veness, Joel
Bellemare, Marc G.
Graves, Alex
Riedmiller, Martin
Fidjeland, Andreas K.
Ostrovski, Georg
Petersen, Stig
Beattie, Charles
Sadik, Amir
Antonoglou, Ioannis
King, Helen
Kumaran, Dharshan
Wierstra, Daan
Legg, Shane
Hassabis, Demis
[J]. NATURE, 2015, 518 (7540) : 529 - 533
[24] Nagabandi A, 2018, IEEE INT CONF ROBOT, P7579
[25] Operational space control: A theoretical and empirical comparison
Nakanishi, Jun
Cory, Rick
Mistry, Michael
Peters, Jan
Schaal, Stefan
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (06) : 737 - 757
[26] Analytic Deep Neural Network-Based Robot Control
Nguyen, Huu-Thiet
Cheah, Chien Chern
[J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (04) : 2176 - 2184
[27] Real-Robot Deep Reinforcement Learning: Improving Trajectory Tracking of Flexible-Joint Manipulator with Reference Correction
Pavlichenko, Dmytro
Behnke, Sven
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2671 - 2677
[28] Analyzing Neural Jacobian Methods in Applications of Visual Servoing and Kinematic Control
Przystupa, Michael
Dehghan, Masood
Jagersand, Martin
Mahmood, A. Rupam
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14276 - 14283
[29] Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
Raissi, M.
Perdikaris, P.
Karniadakis, G. E.
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2019, 378 : 686 - 707
[30] Rueckert E, 2017, 2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), P811, DOI 10.1109/HUMANOIDS.2017.8246965

← 1 2 3 4 →