Learning-based robust optimal tracking controller design for unmanned underwater vehicles with full-state and input constraints

被引:10
作者
Dong, Botao [1 ,2 ]
Shi, Yi [1 ]
Xie, Wei [1 ,2 ]
Chen, Weixing [3 ]
Zhang, Weidong [1 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automation, Shanghai, Peoples R China
[2] Harbin Engn Univ, Natl Key Lab Sci & Technol Underwater Vehicle, Harbin, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China
[4] Hainan Univ, Sch Informat & Commun Engn, Haikou, Peoples R China
基金
中国国家自然科学基金;
关键词
Full-state and input constraints; Optimal tracking control; Lumped disturbances; Reinforcement learning; MODEL-PREDICTIVE CONTROL; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2023.113757
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In this article, the optimal tracking control problem for unmanned underwater vehicles (UUVs) with full -state and input constraints under the presence of external disturbances and internal dynamic uncertainties is addressed. To achieve preassigned state constraints on UUVs, the traditional UUVs model is transformed into an unconstrained one by using two different nonlinear mappings (NMs). Then the robust tracking control problem of traditional UUVs model under position/Euler angles and velocity constraints is transformed to an optimal control problem of the transformed system without any constraints. A learning-based optimal control method is designed to solve the optimal control problem of the transformed system by employing the optimized backstepping (OB) paradigm and reinforcement-learning (RL) technique, achieving uniformly ultimately boundedness (UUB) subject to optimal cost. To deal with lumped disturbances for the velocity control loop, a neural-network (NN) identifier is employed and incorporated into actor-critic architecture, attaining robust tracking performance. Due to the adopted nonquadratic cost function with respect to the control input, the optimal control solution is established in the form of a hyperbolic tangent function to handle the input constraints. Compared with traditional PID method and MPC approach, the proposed controller can improve tracking performance of UUV by 32.04% and 79.64%, respectively.
引用
收藏
页数:13
相关论文
共 43 条
  • [31] Reinforcement Learning-Based Optimal Tracking Control for Levitation System of Maglev Vehicle With Input Time Delay
    Sun, Yougang
    Xu, Junqi
    Chen, Chen
    Hu, Wei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [32] Safe Consensus Tracking With Guaranteed Full State and Input Constraints: A Control Barrier Function-Based Approach
    Fu, Junjie
    Wen, Guanghui
    Yu, Xinghuo
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 8075 - 8081
  • [33] Event-Based Nonsingular Fixed-Time Tracking Control of an Uncertain Manipulator System Subject to Full-State Static Constraints
    Zhang, Zhongcai
    Gao, Yang
    Sun, Wei
    Wu, Yuqiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (07) : 3980 - 3993
  • [34] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
    Kim, Yeonsoo
    Kim, Jong Woo
    AICHE JOURNAL, 2022, 68 (05)
  • [35] Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles
    Farzanegan, Behzad
    Jagannathan, S.
    2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024, 2024, : 651 - 656
  • [36] Continuous-Discrete Observation-Based Robust Tracking Control of Underwater Vehicles: Design, Stability Analysis, and Experiments
    Tijjani, Auwal Shehu
    Chemori, Ahmed
    Ali, Sofiane Ahmed
    Creuze, Vincent
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2023, 31 (04) : 1477 - 1492
  • [37] Design of estimator-based nonlinear dynamic inversion controller and nonlinear regulator for robust trajectory tracking with aerial vehicles
    Homayouni Amlashi A.
    Mojed Gharamaleki R.
    Hamidi Nejad M.H.
    Mirzaei M.
    International Journal of Dynamics and Control, 2018, 6 (2) : 707 - 725
  • [38] Online Optimization-Based Time-Optimal Adaptive Robust Control of Linear Motors With Input and State Constraints
    Liu, Yingqiang
    Chen, Zheng
    Yao, Bin
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 3157 - 3165
  • [39] Extended State Observer-Based Controller With Model Predictive Governor for 3-D Trajectory Tracking of Underactuated Underwater Vehicles
    Kong, Shihan
    Sun, Jinlin
    Qiu, Changlin
    Wu, Zhengxing
    Yu, Junzhi
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 6114 - 6124
  • [40] Observed-based fast fixed-time fault-tolerant trajectory tracking control with prescribed performance for unmanned underwater vehicles under constraints
    Liang, Hongtao
    Yu, Junzhi
    Li, Huiping
    OCEAN ENGINEERING, 2025, 320