Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations

被引:16
作者
Wei, Ziping [1 ]
Du, Jialu [1 ,2 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian, Peoples R China
[2] Dalian Maritime Univ, Sch Marine Engn, Dalian 116026, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
input saturations; reinforcement learning; surface vessels; trajectory tracking; unknown dynamics; WAVE ENERGY CONVERTERS; ADAPTIVE NN CONTROL; NONLINEAR-SYSTEMS; CONTROL ALGORITHM; UNKNOWN DYNAMICS; CONTROL DESIGN; ROBUST-CONTROL;
D O I
10.1002/rnc.6597
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a reinforcement learning (RL)-based optimal trajectory tracking control scheme of surface vessels with unknown dynamics, unknown disturbances, and input saturations of surface vessels. The control scheme is designed by combining the optimal control theory, adaptive neural networks, and the RL method in a unified actor-critic NN framework. A hyperbolic-type penalty function of the control input is designed so as to deal with the input saturations of surface vessels. An actor-critic NN-based RL mechanism is established to learn the optimal trajectory tracking control law without the knowledge of the surface vessel dynamics and disturbances, where NN weights are tuned online on the basis of devised tuning laws. Theoretical analysis and simulation results prove that the proposed RL-based optimal trajectory tracking control scheme can ensure surface vessels track the desired trajectory, while guaranteeing the boundedness of all signals in the surface vessel optimal trajectory tracking closed-loop control system.
引用
收藏
页码:3807 / 3825
页数:19
相关论文
共 62 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].
Beard, RW ;
Saridis, GN ;
Wen, JT .
AUTOMATICA, 1997, 33 (12) :2159-2177
[3]   Adaptive fuzzy tracking control for underactuated surface vessels with unmodeled dynamics and input saturation [J].
Deng, Yingjie ;
Zhang, Xianku ;
Im, Namkyun ;
Zhang, Guoqing ;
Zhang, Qiang .
ISA TRANSACTIONS, 2020, 103 :52-62
[4]   Adaptive Robust Nonlinear Control Design for Course Tracking of Ships Subject to External Disturbances and Input Saturation [J].
Du, Jialu ;
Hu, Xin ;
Sun, Yuqing .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (01) :193-202
[5]   Robust dynamic positioning of ships with disturbances under input saturation [J].
Du, Jialu ;
Hu, Xin ;
Krstic, Miroslav ;
Sun, Yuqing .
AUTOMATICA, 2016, 73 :207-214
[6]   UNIVERSAL APPROXIMATION OF AN UNKNOWN MAPPING AND ITS DERIVATIVES USING MULTILAYER FEEDFORWARD NETWORKS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1990, 3 (05) :551-560
[7]   Robust adaptive NN control of dynamically positioned vessels under input constraints [J].
Hu, Xin ;
Du, Jialu ;
Zhu, Guibing ;
Sun, Yuqing .
NEUROCOMPUTING, 2018, 318 :201-212
[8]   Robust nonlinear control design for dynamic positioning of marine vessels with thruster system dynamics [J].
Hu, Xin ;
Du, Jialu .
NONLINEAR DYNAMICS, 2018, 94 (01) :365-376
[9]   Attention-Based Meta-Reinforcement Learning for Tracking Control of AUV With Time-Varying Dynamics [J].
Jiang, Peng ;
Song, Shiji ;
Huang, Gao .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) :6388-6401
[10]   Actor-Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems [J].
Kiumarsi, Bahare ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) :140-151