Robotic trajectory tracking control method based on reinforcement learning

被引:0
作者
Liu W. [1 ]
Xing G. [2 ]
Chen H. [1 ]
Sun H. [1 ]
机构
[1] School of Control Science and Engineering, Hebei University of Technology, Tianjin
[2] School of Automation and Electronic Engineering, Qingdao University of Science and Technology, Qingdao
来源
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS | 2018年 / 24卷 / 08期
关键词
Feedforward neural network; PD controller; Reinforcement learning; Robot; Trajectory tracking;
D O I
10.13196/j.cims.2018.08.011
中图分类号
学科分类号
摘要
To improve the working performance of robotic trajectory tracking controller, the robotic trajectory tracking control method based on reinforcement learning was proposed. The basic principle of reinforcement learning was introduced, and then the robot trajectory tracking control strategy based on SARSA was proposed. By using the reinforcement learning, the unknown disturbance factors were compensated and the adaptability to the unknown was improved after the PD control method was applied. The experimental results verified the feasibility and effectiveness of the reinforcement learning method in the trajectory tracking problem of robot arms, and the learning speed of the controller was enhanced. © 2018, Editorial Department of CIMS. All right reserved.
引用
收藏
页码:1996 / 2004
页数:8
相关论文
共 20 条
  • [1] Lou W., Guo X., Adaptive trajectory tracking control using reinforcement learning for quadrotor, International Journal of Advanced Robotic Systems, 13, 1, (2016)
  • [2] Zhang S., Li S., Reinforcement learning based obstacle avoidance for robotic manipulator, Machinery Design & Manufacture, 8, pp. 140-142, (2007)
  • [3] Shah H., Gopal M., Reinforcement learning control of robot manipulators in uncertain environments, Proceedings of the IEEE International Conference on Industrial Technology, pp. 1-6, (2009)
  • [4] Hagen S.T., Krose B., Neural Q-learning, Neural Computing & Applications, 12, 2, pp. 81-88, (2003)
  • [5] Pyeatt L.D., Reinforcement Learning with Decision Tree, Proceedings of the 21st IASTED International Multi-Conference on Applied Informatics, pp. 26-31, (2003)
  • [6] Wang X., Tian X., Cheng Y., Value approximation with leastsquare support vector machines in reinforcement learning system, Journal of Computational and Theoretical Nanoscience, 4, 7-8, pp. 1290-1294, (2007)
  • [7] Lewis F.L., Neural network control of robot manipulators, IEEE Expert, 11, 3, pp. 64-75, (1996)
  • [8] Cui Y., Matsubara T., Sugimoto K., Pneumatic artificial muscle-driven robot control using local update reinforcement learning, Advanced Robotics, 31, 8, pp. 397-412, (2017)
  • [9] Modares H., Ranatunga I., Lewis F.L., Et al., Optimized assistive human-robot interaction using reinforcement learning, IEEE Transactions on Cybernetics, 46, 3, pp. 655-667, (2016)
  • [10] Sutton R.S., Barto A.G., Reinforcement Learning: An Introduction, (1998)