Learning-based robust optimal tracking controller design for unmanned underwater vehicles with full-state and input constraints

被引：10

作者：

Dong, Botao ^{[1
,2
]}

Shi, Yi ^{[1
]}

Xie, Wei ^{[1
,2
]}

Chen, Weixing ^{[3
]}

Zhang, Weidong ^{[1
,4
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automation, Shanghai, Peoples R China

[2] Harbin Engn Univ, Natl Key Lab Sci & Technol Underwater Vehicle, Harbin, Peoples R China

[3] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China

[4] Hainan Univ, Sch Informat & Commun Engn, Haikou, Peoples R China

来源：

OCEAN ENGINEERING | 2023年 / 271卷

基金：

中国国家自然科学基金;

关键词：

Full-state and input constraints; Optimal tracking control; Lumped disturbances; Reinforcement learning; MODEL-PREDICTIVE CONTROL; TRAJECTORY TRACKING;

D O I：

10.1016/j.oceaneng.2023.113757

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

In this article, the optimal tracking control problem for unmanned underwater vehicles (UUVs) with full -state and input constraints under the presence of external disturbances and internal dynamic uncertainties is addressed. To achieve preassigned state constraints on UUVs, the traditional UUVs model is transformed into an unconstrained one by using two different nonlinear mappings (NMs). Then the robust tracking control problem of traditional UUVs model under position/Euler angles and velocity constraints is transformed to an optimal control problem of the transformed system without any constraints. A learning-based optimal control method is designed to solve the optimal control problem of the transformed system by employing the optimized backstepping (OB) paradigm and reinforcement-learning (RL) technique, achieving uniformly ultimately boundedness (UUB) subject to optimal cost. To deal with lumped disturbances for the velocity control loop, a neural-network (NN) identifier is employed and incorporated into actor-critic architecture, attaining robust tracking performance. Due to the adopted nonquadratic cost function with respect to the control input, the optimal control solution is established in the form of a hyperbolic tangent function to handle the input constraints. Compared with traditional PID method and MPC approach, the proposed controller can improve tracking performance of UUV by 32.04% and 79.64%, respectively.

引用

页数：13

共 43 条

[31] Reinforcement Learning-Based Optimal Tracking Control for Levitation System of Maglev Vehicle With Input Time Delay
Sun, Yougang
Xu, Junqi
Chen, Chen
Hu, Wei
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[32] Safe Consensus Tracking With Guaranteed Full State and Input Constraints: A Control Barrier Function-Based Approach
Fu, Junjie
Wen, Guanghui
Yu, Xinghuo
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 8075 - 8081
[33] Event-Based Nonsingular Fixed-Time Tracking Control of an Uncertain Manipulator System Subject to Full-State Static Constraints
Zhang, Zhongcai
Gao, Yang
Sun, Wei
Wu, Yuqiang
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (07) : 3980 - 3993
[34] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
Kim, Yeonsoo
Kim, Jong Woo
AICHE JOURNAL, 2022, 68 (05)
[35] Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles
Farzanegan, Behzad
Jagannathan, S.
2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024, 2024, : 651 - 656
[36] Continuous-Discrete Observation-Based Robust Tracking Control of Underwater Vehicles: Design, Stability Analysis, and Experiments
Tijjani, Auwal Shehu
Chemori, Ahmed
Ali, Sofiane Ahmed
Creuze, Vincent
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2023, 31 (04) : 1477 - 1492
[37] Design of estimator-based nonlinear dynamic inversion controller and nonlinear regulator for robust trajectory tracking with aerial vehicles
Homayouni Amlashi A.
Mojed Gharamaleki R.
Hamidi Nejad M.H.
Mirzaei M.
International Journal of Dynamics and Control, 2018, 6 (2) : 707 - 725
[38] Online Optimization-Based Time-Optimal Adaptive Robust Control of Linear Motors With Input and State Constraints
Liu, Yingqiang
Chen, Zheng
Yao, Bin
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 3157 - 3165
[39] Extended State Observer-Based Controller With Model Predictive Governor for 3-D Trajectory Tracking of Underactuated Underwater Vehicles
Kong, Shihan
Sun, Jinlin
Qiu, Changlin
Wu, Zhengxing
Yu, Junzhi
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 6114 - 6124
[40] Observed-based fast fixed-time fault-tolerant trajectory tracking control with prescribed performance for unmanned underwater vehicles under constraints
Liang, Hongtao
Yu, Junzhi
Li, Huiping
OCEAN ENGINEERING, 2025, 320

← 1 2 3 4 5 →