Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations

被引:10
|
作者
Wei, Ziping [1 ]
Du, Jialu [1 ,2 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian, Peoples R China
[2] Dalian Maritime Univ, Sch Marine Engn, Dalian 116026, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
input saturations; reinforcement learning; surface vessels; trajectory tracking; unknown dynamics; WAVE ENERGY CONVERTERS; ADAPTIVE NN CONTROL; NONLINEAR-SYSTEMS; CONTROL ALGORITHM; UNKNOWN DYNAMICS; CONTROL DESIGN; ROBUST-CONTROL;
D O I
10.1002/rnc.6597
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a reinforcement learning (RL)-based optimal trajectory tracking control scheme of surface vessels with unknown dynamics, unknown disturbances, and input saturations of surface vessels. The control scheme is designed by combining the optimal control theory, adaptive neural networks, and the RL method in a unified actor-critic NN framework. A hyperbolic-type penalty function of the control input is designed so as to deal with the input saturations of surface vessels. An actor-critic NN-based RL mechanism is established to learn the optimal trajectory tracking control law without the knowledge of the surface vessel dynamics and disturbances, where NN weights are tuned online on the basis of devised tuning laws. Theoretical analysis and simulation results prove that the proposed RL-based optimal trajectory tracking control scheme can ensure surface vessels track the desired trajectory, while guaranteeing the boundedness of all signals in the surface vessel optimal trajectory tracking closed-loop control system.
引用
收藏
页码:3807 / 3825
页数:19
相关论文
共 50 条
  • [1] Reinforcement learning-based trajectory tracking optimal control of unmanned surface vehicles in narrow water areas
    Wei, Ziping
    Du, Jialu
    ISA TRANSACTIONS, 2025, 159 : 152 - 164
  • [2] Robust trajectory tracking control of marine surface vessels with uncertain disturbances and input saturations
    Liu, Haitao
    Chen, Guangjun
    NONLINEAR DYNAMICS, 2020, 100 (04) : 3513 - 3528
  • [3] Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle
    Wang, Ning
    Gao, Ying
    Zhao, Hong
    Ahn, Choon Ki
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3034 - 3045
  • [4] Robust Adaptive Trajectory Linearization Control for Tracking Control of Surface Vessels With Modeling Uncertainties Under Input Saturation
    Qiu, Bingbing
    Wang, Guofeng
    Fan, Yunsheng
    Mu, Dongdong
    Sun, Xiaojie
    IEEE ACCESS, 2019, 7 : 5057 - 5070
  • [5] Reinforcement Learning-Based Fixed-Time Trajectory Tracking Control for Uncertain Robotic Manipulators With Input Saturation
    Cao, Shengjie
    Sun, Liang
    Jiang, Jingjing
    Zuo, Zongyu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4584 - 4595
  • [6] Reinforcement learning-based event-triggered optimal control for unknown nonlinear systems with input delay
    Chen, Xiangyu
    Sun, Weiwei
    Gao, Xinci
    Li, Yongshu
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (07) : 4844 - 4863
  • [7] Prescribed performance-based tracking control for quadrotor UAV under input delays and input saturations
    Zhu, Bing
    Chen, Mou
    Li, Tao
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2022, 44 (10) : 2049 - 2062
  • [8] Smooth reinforcement learning-based trajectory tracking for articulated vehicles
    Chen, Liangfa
    Song, Xujie
    Xiao, Liming
    Gao, Lulu
    Zhang, Fawang
    Li, Shengbo
    Ma, Fei
    Duan, Jingliang
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2024, 56 (12): : 116 - 123
  • [9] Online learning control of surface vessels for fine trajectory tracking
    Li, Guoyuan
    Li, Wei
    Hildre, Hans Petter
    Zhang, Houxiang
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY, 2016, 21 (02) : 251 - 260
  • [10] Online learning control of surface vessels for fine trajectory tracking
    Guoyuan Li
    Wei Li
    Hans Petter Hildre
    Houxiang Zhang
    Journal of Marine Science and Technology, 2016, 21 : 251 - 260