Trajectory-Tracking Control of Robotic Systems via Deep Reinforcement Learning

被引:0
作者
Zhang, Shansi [1 ]
Sun, Chao [1 ]
Feng, Zhi [1 ]
Hu, Guoqiang [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
来源
PROCEEDINGS OF THE IEEE 2019 9TH INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) ROBOTICS, AUTOMATION AND MECHATRONICS (RAM) (CIS & RAM 2019) | 2019年
关键词
Trajectory tracking; DDPG; distributed framework;
D O I
10.1109/cis-ram47153.2019.9095802
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the trajectory tracking problems for a robotic manipulator and a mobile robot by using deep reinforcement learning based methods. A model-free deep reinforcement learning method based on Deep Deterministic Policy Gradient (DDPG) is designed for training. The priority replay memory is adopted to sample more significant transitions at each update. A distributed framework with multiple workers is proposed. Synchronous workers generate transitions and compute gradients for the global network, and collecting workers explore the environment with different policies and exploration noises. During the training, we adopt random reference slate initialization to solve the exploration problem, which can make the robots learn from the reference trajectory effectively. Numerical simulations are provided to demonstrate the effectiveness and efficiency of the proposed methods. It can be seen from the simulation results that the agent trained by the proposed distributed DDPG could learn faster and achieve smaller tracking errors than DDPG.
引用
收藏
页码:386 / 391
页数:6
相关论文
共 16 条
[1]  
Alshamasin M.S., 2009, EUROPEAN J SCI RES, V37, P388
[2]  
Andrychowicz M., 2018, ARXIV180800177
[3]  
Hessel M, 2018, AAAI CONF ARTIF INTE, P3215
[4]   Optimising SME Potential in Modern Healthcare Systems: Challenges, Opportunities and Policy Recommendations [J].
Horgan, Denis ;
van Kranen, Henk J. ;
Morre, Servaas A. .
PUBLIC HEALTH GENOMICS, 2018, 21 (1-2) :1-17
[5]  
Lillicrap TP, 2015, ARXIV150902971
[6]  
Mnih V, 2016, PR MACH LEARN RES, V48
[7]   Human-level control through deep reinforcement learning [J].
Mnih, Volodymyr ;
Kavukcuoglu, Koray ;
Silver, David ;
Rusu, Andrei A. ;
Veness, Joel ;
Bellemare, Marc G. ;
Graves, Alex ;
Riedmiller, Martin ;
Fidjeland, Andreas K. ;
Ostrovski, Georg ;
Petersen, Stig ;
Beattie, Charles ;
Sadik, Amir ;
Antonoglou, Ioannis ;
King, Helen ;
Kumaran, Dharshan ;
Wierstra, Daan ;
Legg, Shane ;
Hassabis, Demis .
NATURE, 2015, 518 (7540) :529-533
[8]  
Mnih Volodymyr, 2013, CORR
[9]  
Nair A., 2015, Massively parallel methods for deep reinforcement learning
[10]  
Peng XB, 2021, ACM T GRAPHIC, V40, DOI [10.1145/3450626.3459670, 10.1145/3197517.3201311]