Trajectory-Tracking Control of Robotic Systems via Deep Reinforcement Learning

被引：0

作者：

Zhang, Shansi ^{[1
]}

Sun, Chao ^{[1
]}

Feng, Zhi ^{[1
]}

Hu, Guoqiang ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore

来源：

PROCEEDINGS OF THE IEEE 2019 9TH INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) ROBOTICS, AUTOMATION AND MECHATRONICS (RAM) (CIS & RAM 2019) | 2019年

关键词：

Trajectory tracking; DDPG; distributed framework;

D O I：

10.1109/cis-ram47153.2019.9095802

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the trajectory tracking problems for a robotic manipulator and a mobile robot by using deep reinforcement learning based methods. A model-free deep reinforcement learning method based on Deep Deterministic Policy Gradient (DDPG) is designed for training. The priority replay memory is adopted to sample more significant transitions at each update. A distributed framework with multiple workers is proposed. Synchronous workers generate transitions and compute gradients for the global network, and collecting workers explore the environment with different policies and exploration noises. During the training, we adopt random reference slate initialization to solve the exploration problem, which can make the robots learn from the reference trajectory effectively. Numerical simulations are provided to demonstrate the effectiveness and efficiency of the proposed methods. It can be seen from the simulation results that the agent trained by the proposed distributed DDPG could learn faster and achieve smaller tracking errors than DDPG.

引用

页码：386 / 391

页数：6

共 16 条

[1]

Alshamasin M.S., 2009, EUROPEAN J SCI RES, V37, P388

[2]

Andrychowicz M., 2018, ARXIV180800177

[3]

Hessel M, 2018, AAAI CONF ARTIF INTE, P3215

[4] Optimising SME Potential in Modern Healthcare Systems: Challenges, Opportunities and Policy Recommendations [J].

Horgan, Denis ;

van Kranen, Henk J. ;

Morre, Servaas A. .

PUBLIC HEALTH GENOMICS, 2018, 21 (1-2) :1-17

[5]

Lillicrap TP, 2015, ARXIV150902971

[6]

Mnih V, 2016, PR MACH LEARN RES, V48

[7] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[8]

Mnih Volodymyr, 2013, CORR

[9]

Nair A., 2015, Massively parallel methods for deep reinforcement learning

[10]

Peng XB, 2021, ACM T GRAPHIC, V40, DOI [10.1145/3450626.3459670, 10.1145/3197517.3201311]

← 1 2 →