A Reinforcement Learning Neural Network for Robotic Manipulator Control

被引:28
作者
Hu, Yazhou [1 ,2 ]
Si, Bailu [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
TIME-DELAY SYSTEMS; ADAPTIVE-CONTROL; TRACKING CONTROL; TARGET DETECTION; DESIGN;
D O I
10.1162/neco_a_01079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a neural network model for reinforcement learning to control a robotic manipulator with unknown parameters and dead zones. The model is composed of three networks. The state of the robotic manipulator is predicted by the state network of the model, the action policy is learned by the action network, and the performance index of the action policy is estimated by a critic network. The three networks work together to optimize the performance index based on the reinforcement learning control scheme. The convergence of the learning methods is analyzed. Application of the proposed model on a simulated two-link robotic manipulator demonstrates the effectiveness and the stability of the model.
引用
收藏
页码:1983 / 2004
页数:22
相关论文
共 63 条
[1]  
Alessandri A., 2004, AUTOMATICA, V40, P2011, DOI DOI 10.1016/J.AUT0MATICA.2004.05.014
[2]  
[Anonymous], 2013, INT C MACHINE LEARNI
[3]  
Azeem MM, 2012, COMM COM INF SC, V281, P144
[4]  
Baizid K., 2010, P 21 IASTED INT C MO
[5]  
Chang-hoi K, 2015, INT C CONTR AUTOMAT, P1154, DOI 10.1109/ICCAS.2015.7364801
[6]   Adaptive Consensus Control for a Class of Nonlinear Multiagent Time-Delay Systems Using Neural Networks [J].
Chen, C. L. Philip ;
Wen, Guo-Xing ;
Liu, Yan-Jun ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (06) :1217-1226
[7]   Fuzzy Neural Network-Based Adaptive Control for a Class of Uncertain Nonlinear Stochastic Systems [J].
Chen, C. L. Philip ;
Liu, Yan-Jun ;
Wen, Guo-Xing .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (05) :583-593
[8]   Adaptive NN Backstepping Output-Feedback Control for Stochastic Nonlinear Strict-Feedback Systems With Time-Varying Delays [J].
Chen, Weisheng ;
Jiao, Licheng ;
Li, Jing ;
Li, Ruihong .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2010, 40 (03) :939-950
[9]   Adaptive Tracking for Periodically Time-Varying and Nonlinearly Parameterized Systems Using Multilayer Neural Networks [J].
Chen, Weisheng ;
Jiao, Licheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (02) :345-351
[10]   Adaptive Optimal Control Without Weight Transport [J].
Chinta, Lakshminarayan V. ;
Tweed, Douglas B. .
NEURAL COMPUTATION, 2012, 24 (06) :1487-1518