Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations

被引：15

作者：

Wei, Ziping ^{[1
]}

Du, Jialu ^{[1
,2
]}

机构：

[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian, Peoples R China

[2] Dalian Maritime Univ, Sch Marine Engn, Dalian 116026, Liaoning, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2023年 / 33卷 / 06期

基金：

中国国家自然科学基金;

关键词：

input saturations; reinforcement learning; surface vessels; trajectory tracking; unknown dynamics; WAVE ENERGY CONVERTERS; ADAPTIVE NN CONTROL; NONLINEAR-SYSTEMS; CONTROL ALGORITHM; UNKNOWN DYNAMICS; CONTROL DESIGN; ROBUST-CONTROL;

D O I：

10.1002/rnc.6597

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper develops a reinforcement learning (RL)-based optimal trajectory tracking control scheme of surface vessels with unknown dynamics, unknown disturbances, and input saturations of surface vessels. The control scheme is designed by combining the optimal control theory, adaptive neural networks, and the RL method in a unified actor-critic NN framework. A hyperbolic-type penalty function of the control input is designed so as to deal with the input saturations of surface vessels. An actor-critic NN-based RL mechanism is established to learn the optimal trajectory tracking control law without the knowledge of the surface vessel dynamics and disturbances, where NN weights are tuned online on the basis of devised tuning laws. Theoretical analysis and simulation results prove that the proposed RL-based optimal trajectory tracking control scheme can ensure surface vessels track the desired trajectory, while guaranteeing the boundedness of all signals in the surface vessel optimal trajectory tracking closed-loop control system.

引用

页码：3807 / 3825

页数：19

共 50 条

[41] Reinforcement Learning-Based Optimal Battery Control Under Cycle-Based Degradation Cost [J].

Kwon, Kyung-bin ;

Zhu, Hao .

IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (06) :4909-4917

[42] Reinforcement learning-based tracking control for AUVs subject to disturbances [J].

Wang, Guangcang ;

Zhang, Dianfeng ;

Wu, Zhaojing .

2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, :2222-2227

[43] Lifelong Learning-Based Optimal Trajectory Tracking Control of Constrained Nonlinear Affine Systems Using Deep Neural Networks [J].

Ganie, Irfan ;

Jagannathan, Sarangapani .

IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (12) :7133-7146

[44] Adaptive Output Feedback Tracking Control of Surface Vessels Under Input Saturation [J].

Li, Jian ;

Du, Jialu ;

Hu, Xin .

PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, :1288-1293

[45] Reinforcement learning-based prescribed finite-time optimal tracking control for a vehicle system regardless of initial position [J].

Liu, Ying ;

Li, Xiaohua ;

Liu, Hui .

PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, :3453-3458

[46] Off-policy integral reinforcement learning-based optimal tracking control for a class of nonzero-sum game systems with unknown dynamics [J].

Zhao, Jin-Gang ;

Chen, Fang-Fang .

OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (06) :1623-1644

[47] Supervised reinforcement learning based trajectory tracking control for autonomous vehicles [J].

Mihaly, Andras ;

Van Tan Vu ;

Trong Tu Do ;

Gaspar, Peter .

IFAC PAPERSONLINE, 2024, 58 (10) :140-145

[48] USV Trajectory Tracking Control Based on Receding Horizon Reinforcement Learning [J].

Wen, Yinghan ;

Chen, Yuepeng ;

Guo, Xuan .

SENSORS, 2024, 24 (09)

[49] Fixed-time sliding mode trajectory tracking control for marine surface vessels under mismatched conditions and input saturation [J].

Zhang, Jingqi ;

Yu, Shuanghe ;

Yan, Yan ;

Zhao, Ying .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (09) :6142-6164

[50] Trajectory Tracking Control for Underactuated Surface Vessels Based on Nonlinear Model Predictive Control [J].

Liu, Chenguang ;

Zheng, Huarong ;

Negenborn, Rudy R. ;

Chu, Xiumin ;

Wang, Le .

COMPUTATIONAL LOGISTICS (ICCL 2015), 2015, 9335 :166-180

← 1 2 3 4 5 →