A Reinforcement Learning Neural Network for Robotic Manipulator Control

被引：28

作者：

Hu, Yazhou ^{[1
,2
]}

Si, Bailu ^{[1
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

NEURAL COMPUTATION | 2018年 / 30卷 / 07期

关键词：

TIME-DELAY SYSTEMS; ADAPTIVE-CONTROL; TRACKING CONTROL; TARGET DETECTION; DESIGN;

D O I：

10.1162/neco_a_01079

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a neural network model for reinforcement learning to control a robotic manipulator with unknown parameters and dead zones. The model is composed of three networks. The state of the robotic manipulator is predicted by the state network of the model, the action policy is learned by the action network, and the performance index of the action policy is estimated by a critic network. The three networks work together to optimize the performance index based on the reinforcement learning control scheme. The convergence of the learning methods is analyzed. Application of the proposed model on a simulated two-link robotic manipulator demonstrates the effectiveness and the stability of the model.

引用

页码：1983 / 2004

页数：22

共 63 条

[1]

Alessandri A., 2004, AUTOMATICA, V40, P2011, DOI DOI 10.1016/J.AUT0MATICA.2004.05.014

[2]

[Anonymous], 2013, INT C MACHINE LEARNI

[3]

Azeem MM, 2012, COMM COM INF SC, V281, P144

[4]

Baizid K., 2010, P 21 IASTED INT C MO

[5]

Chang-hoi K, 2015, INT C CONTR AUTOMAT, P1154, DOI 10.1109/ICCAS.2015.7364801

[6] Adaptive Consensus Control for a Class of Nonlinear Multiagent Time-Delay Systems Using Neural Networks [J].

Chen, C. L. Philip ;

Wen, Guo-Xing ;

Liu, Yan-Jun ;

Wang, Fei-Yue .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (06) :1217-1226

[7] Fuzzy Neural Network-Based Adaptive Control for a Class of Uncertain Nonlinear Stochastic Systems [J].

Chen, C. L. Philip ;

Liu, Yan-Jun ;

Wen, Guo-Xing .

IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (05) :583-593

[8] Adaptive NN Backstepping Output-Feedback Control for Stochastic Nonlinear Strict-Feedback Systems With Time-Varying Delays [J].

Chen, Weisheng ;

Jiao, Licheng ;

Li, Jing ;

Li, Ruihong .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2010, 40 (03) :939-950

[9] Adaptive Tracking for Periodically Time-Varying and Nonlinearly Parameterized Systems Using Multilayer Neural Networks [J].

Chen, Weisheng ;

Jiao, Licheng .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (02) :345-351

[10] Adaptive Optimal Control Without Weight Transport [J].

Chinta, Lakshminarayan V. ;

Tweed, Douglas B. .

NEURAL COMPUTATION, 2012, 24 (06) :1487-1518

← 1 2 3 4 5 6 7 →