Learning-based robust optimal tracking controller design for unmanned underwater vehicles with full-state and input constraints

被引：10

作者：

Dong, Botao ^{[1
,2
]}

Shi, Yi ^{[1
]}

Xie, Wei ^{[1
,2
]}

Chen, Weixing ^{[3
]}

Zhang, Weidong ^{[1
,4
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automation, Shanghai, Peoples R China

[2] Harbin Engn Univ, Natl Key Lab Sci & Technol Underwater Vehicle, Harbin, Peoples R China

[3] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China

[4] Hainan Univ, Sch Informat & Commun Engn, Haikou, Peoples R China

来源：

OCEAN ENGINEERING | 2023年 / 271卷

基金：

中国国家自然科学基金;

关键词：

Full-state and input constraints; Optimal tracking control; Lumped disturbances; Reinforcement learning; MODEL-PREDICTIVE CONTROL; TRAJECTORY TRACKING;

D O I：

10.1016/j.oceaneng.2023.113757

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

In this article, the optimal tracking control problem for unmanned underwater vehicles (UUVs) with full -state and input constraints under the presence of external disturbances and internal dynamic uncertainties is addressed. To achieve preassigned state constraints on UUVs, the traditional UUVs model is transformed into an unconstrained one by using two different nonlinear mappings (NMs). Then the robust tracking control problem of traditional UUVs model under position/Euler angles and velocity constraints is transformed to an optimal control problem of the transformed system without any constraints. A learning-based optimal control method is designed to solve the optimal control problem of the transformed system by employing the optimized backstepping (OB) paradigm and reinforcement-learning (RL) technique, achieving uniformly ultimately boundedness (UUB) subject to optimal cost. To deal with lumped disturbances for the velocity control loop, a neural-network (NN) identifier is employed and incorporated into actor-critic architecture, attaining robust tracking performance. Due to the adopted nonquadratic cost function with respect to the control input, the optimal control solution is established in the form of a hyperbolic tangent function to handle the input constraints. Compared with traditional PID method and MPC approach, the proposed controller can improve tracking performance of UUV by 32.04% and 79.64%, respectively.

引用

页数：13

共 43 条

[41] Adaptive learning-based optimal tracking control system design and analysis of a disturbed nonlinear hypersonic vehicle model
An, Kai
Wang, Zhenguo
Huang, Wei
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (06) : 1893 - 1906
[42] A novel model-free robust saturated reinforcement learning-based controller for quadrotors guaranteeing prescribed transient and steady state performance
Elhaki, Omid
Shojaei, Khoshnam
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119
[43] Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints
Mohammadi, Mehdi
Arefi, Mohammad Mehdi
Setoodeh, Peyman
Kaynak, Okyay
INFORMATION SCIENCES, 2021, 554 : 84 - 98

← 1 2 3 4 5 →