Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations

被引:16
作者
Wei, Ziping [1 ]
Du, Jialu [1 ,2 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian, Peoples R China
[2] Dalian Maritime Univ, Sch Marine Engn, Dalian 116026, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
input saturations; reinforcement learning; surface vessels; trajectory tracking; unknown dynamics; WAVE ENERGY CONVERTERS; ADAPTIVE NN CONTROL; NONLINEAR-SYSTEMS; CONTROL ALGORITHM; UNKNOWN DYNAMICS; CONTROL DESIGN; ROBUST-CONTROL;
D O I
10.1002/rnc.6597
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a reinforcement learning (RL)-based optimal trajectory tracking control scheme of surface vessels with unknown dynamics, unknown disturbances, and input saturations of surface vessels. The control scheme is designed by combining the optimal control theory, adaptive neural networks, and the RL method in a unified actor-critic NN framework. A hyperbolic-type penalty function of the control input is designed so as to deal with the input saturations of surface vessels. An actor-critic NN-based RL mechanism is established to learn the optimal trajectory tracking control law without the knowledge of the surface vessel dynamics and disturbances, where NN weights are tuned online on the basis of devised tuning laws. Theoretical analysis and simulation results prove that the proposed RL-based optimal trajectory tracking control scheme can ensure surface vessels track the desired trajectory, while guaranteeing the boundedness of all signals in the surface vessel optimal trajectory tracking closed-loop control system.
引用
收藏
页码:3807 / 3825
页数:19
相关论文
共 50 条
[21]   Inverse optimal design for trajectory tracking with input saturations via adaptive recurrent neural control [J].
Ricalde, LJ ;
Sanchez, EN .
42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, :6200-6205
[22]   Reinforcement learning-based NMPC for tracking control of ASVs: Theory and experiments [J].
Martinsen, Andreas B. ;
Lekkas, Anastasios M. ;
Gros, Sebastien .
CONTROL ENGINEERING PRACTICE, 2022, 120
[23]   Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system [J].
Feng, Yiting ;
Wang, Changqing ;
Li, Aijun .
IFAC PAPERSONLINE, 2020, 53 (01) :679-684
[24]   Robotic trajectory tracking control method based on reinforcement learning [J].
Liu W. ;
Xing G. ;
Chen H. ;
Sun H. .
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2018, 24 (08) :1996-2004
[25]   Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems [J].
Zhong-Xin Fan ;
Lintao Tang ;
Shihua Li ;
Rongjie Liu .
Neural Computing and Applications, 2023, 35 :23987-23996
[26]   Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems [J].
Fan, Zhong-Xin ;
Tang, Lintao ;
Li, Shihua ;
Liu, Rongjie .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (33) :23987-23996
[27]   Reinforcement Learning-Based Tracking Control for a Three Mecanum Wheeled Mobile Robot [J].
Zhang, Dianfeng ;
Wang, Guangcang ;
Wu, Zhaojing .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) :1445-1452
[28]   Reinforcement learning-based optimal control of uncertain nonlinear systems [J].
Garcia, Miguel ;
Dong, Wenjie .
INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) :2839-2850
[29]   Deep reinforcement learning-based centralized control for trajectory tracking of distributed drive electric vehicles [J].
Zhang, Zhiyong ;
Zhang, Jiarui ;
Huang, Caixia ;
Wu, You ;
Zhang, Jie .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
[30]   Reinforcement Learning-Based Optimal Fault-Tolerant Tracking Control of Industrial Processes [J].
Wang, Limin ;
Li, Xueyu ;
Zhang, Ridong ;
Gao, Furong .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2023, 62 (39) :16014-16024