Learning-based robust optimal tracking controller design for unmanned underwater vehicles with full-state and input constraints

被引:10
作者
Dong, Botao [1 ,2 ]
Shi, Yi [1 ]
Xie, Wei [1 ,2 ]
Chen, Weixing [3 ]
Zhang, Weidong [1 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automation, Shanghai, Peoples R China
[2] Harbin Engn Univ, Natl Key Lab Sci & Technol Underwater Vehicle, Harbin, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China
[4] Hainan Univ, Sch Informat & Commun Engn, Haikou, Peoples R China
基金
中国国家自然科学基金;
关键词
Full-state and input constraints; Optimal tracking control; Lumped disturbances; Reinforcement learning; MODEL-PREDICTIVE CONTROL; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2023.113757
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In this article, the optimal tracking control problem for unmanned underwater vehicles (UUVs) with full -state and input constraints under the presence of external disturbances and internal dynamic uncertainties is addressed. To achieve preassigned state constraints on UUVs, the traditional UUVs model is transformed into an unconstrained one by using two different nonlinear mappings (NMs). Then the robust tracking control problem of traditional UUVs model under position/Euler angles and velocity constraints is transformed to an optimal control problem of the transformed system without any constraints. A learning-based optimal control method is designed to solve the optimal control problem of the transformed system by employing the optimized backstepping (OB) paradigm and reinforcement-learning (RL) technique, achieving uniformly ultimately boundedness (UUB) subject to optimal cost. To deal with lumped disturbances for the velocity control loop, a neural-network (NN) identifier is employed and incorporated into actor-critic architecture, attaining robust tracking performance. Due to the adopted nonquadratic cost function with respect to the control input, the optimal control solution is established in the form of a hyperbolic tangent function to handle the input constraints. Compared with traditional PID method and MPC approach, the proposed controller can improve tracking performance of UUV by 32.04% and 79.64%, respectively.
引用
收藏
页数:13
相关论文
共 43 条