Distributed optimal formation tracking control based on reinforcement learning for underactuated AUVs with asymmetric constraints

被引:9
|
作者
Wang, Zhengkun [1 ]
Zhang, Lijun [1 ,2 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
[2] Northwestern Polytech Univ, Sch Marine Technol, Xian 710072, Peoples R China
关键词
Distributed optimal; Backstepping; Reinforcement learning; Critic-actor neural networks; ADAPTIVE NEURAL-CONTROL; COLLISION-AVOIDANCE; NONLINEAR-SYSTEMS;
D O I
10.1016/j.oceaneng.2023.114491
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This paper investigates the distributed optimal formation tracking control problem based on backstepping technique and reinforcement learning for multiple underactuated autonomous underwater vehicles (AUVs). Based on the graph theory, we propose a virtual distributed formation tracking controller in the kinematics model while the barrier Lyapunov function is utilized to make sure the connectivity preservation and the collision avoidance. An optimal controller based on the reinforcement learning(RL) is designed to minimize a cost function in the dynamic motion, and critic-actor neural networks (NNs) are further applied for online implementation of the reinforcement learning algorithm. As a result, the optimal control design for the underactuated AUVs with the uncertain Hydrodynamic can be online realized. The command filter is adopted to solve the issue of the explosion of complexity. The simulation results are given to confirm the validity of the proposed method.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Integral reinforcement learning-based optimal tracking control for uncertain nonlinear systems under input constraint and specified performance constraints
    Chang, Ru
    Liu, Zhi-Meng
    Li, Xiao-Bin
    Sun, Chang-Yin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8802 - 8824
  • [32] Fuzzy Optimal Fault-Tolerant Trajectory Tracking Control of Underactuated AUVs With Prescribed Performance in 3-D Space
    Gong, Huibin
    Er, Meng Joo
    Liu, Yi
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (01): : 170 - 182
  • [33] Reinforcement learning-based consensus control for MASs with intermittent constraints
    Luo, Ao
    Zhou, Qi
    Ren, Hongru
    Ma, Hui
    Lu, Renquan
    NEURAL NETWORKS, 2024, 172
  • [34] Reinforcement learning-based trajectory tracking optimal control of unmanned surface vehicles in narrow water areas
    Wei, Ziping
    Du, Jialu
    ISA TRANSACTIONS, 2025, 159 : 152 - 164
  • [35] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
    Kim, Yeonsoo
    Kim, Jong Woo
    AICHE JOURNAL, 2022, 68 (05)
  • [36] Optimal dynamic Control Allocation with guaranteed constraints and online Reinforcement Learning
    Kolaric, Patrik
    Lopez, Victor G.
    Lewis, Frank L.
    AUTOMATICA, 2020, 122
  • [37] Lyapunov-based distributed reinforcement learning control with stability guarantee
    Yao, Jingshi
    Han, Minghao
    Yin, Xunyuan
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 195
  • [38] Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system
    Kim, Jong Woo
    Park, Byung Jun
    Yoo, Haeun
    Lee, Jay H.
    Lee, Jong Min
    IFAC PAPERSONLINE, 2018, 51 (25): : 257 - 262
  • [39] Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system
    Feng, Yiting
    Wang, Changqing
    Li, Aijun
    IFAC PAPERSONLINE, 2020, 53 (01): : 679 - 684
  • [40] Reinforcement Learning-Based Optimal Fault-Tolerant Tracking Control of Industrial Processes
    Wang, Limin
    Li, Xueyu
    Zhang, Ridong
    Gao, Furong
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2023, 62 (39) : 16014 - 16024