Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations

被引:10
作者
Wei, Ziping [1 ]
Du, Jialu [1 ,2 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian, Peoples R China
[2] Dalian Maritime Univ, Sch Marine Engn, Dalian 116026, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
input saturations; reinforcement learning; surface vessels; trajectory tracking; unknown dynamics; WAVE ENERGY CONVERTERS; ADAPTIVE NN CONTROL; NONLINEAR-SYSTEMS; CONTROL ALGORITHM; UNKNOWN DYNAMICS; CONTROL DESIGN; ROBUST-CONTROL;
D O I
10.1002/rnc.6597
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a reinforcement learning (RL)-based optimal trajectory tracking control scheme of surface vessels with unknown dynamics, unknown disturbances, and input saturations of surface vessels. The control scheme is designed by combining the optimal control theory, adaptive neural networks, and the RL method in a unified actor-critic NN framework. A hyperbolic-type penalty function of the control input is designed so as to deal with the input saturations of surface vessels. An actor-critic NN-based RL mechanism is established to learn the optimal trajectory tracking control law without the knowledge of the surface vessel dynamics and disturbances, where NN weights are tuned online on the basis of devised tuning laws. Theoretical analysis and simulation results prove that the proposed RL-based optimal trajectory tracking control scheme can ensure surface vessels track the desired trajectory, while guaranteeing the boundedness of all signals in the surface vessel optimal trajectory tracking closed-loop control system.
引用
收藏
页码:3807 / 3825
页数:19
相关论文
共 50 条
  • [31] A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations
    Farzanegan, Behzad
    Zamani, Mohsen
    Suratgar, Amir Abolfazl
    Menhaj, Mohammad Bagher
    [J]. CONTROL THEORY AND TECHNOLOGY, 2021, 19 (02) : 283 - 294
  • [32] Recursive Slidingmode Dynamic Surface Adaptive Control for Surface Vessels Trajectory Tracking with Input Saturation
    Bi, Yannan
    Shen, Zhipeng
    Yu, Haomiao
    Guo, Chen
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4600 - 4607
  • [33] A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations
    Behzad Farzanegan
    Mohsen Zamani
    Amir Abolfazl Suratgar
    Mohammad Bagher Menhaj
    [J]. Control Theory and Technology, 2021, 19 : 283 - 294
  • [34] Heuristic and deep reinforcement learning-based PID control of trajectory tracking in a ball-and-plate system
    Okafor, Emmanuel
    Udekwe, Daniel
    Ibrahim, Yusuf
    Mu'azu, Muhammed Bashir
    Okafor, Ekene Gabriel
    [J]. JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2021, 5 (02) : 179 - 196
  • [36] Machine learning-based robust trajectory tracking control for FSGR
    Jia, Lin
    Wang, Yaonan
    Zhang, Changfan
    Zhao, Kaihui
    Zhou, Langming
    [J]. JOURNAL OF ENGINEERING-JOE, 2019, 2019 (23): : 9220 - 9225
  • [37] Learning-based parametrized model predictive control for trajectory tracking
    Sferrazza, Carmelo
    Muehlebach, Michael
    D'Andrea, Raffaello
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2020, 41 (06) : 2225 - 2249
  • [38] Adaptive neural network-based fault-tolerant trajectory-tracking control of unmanned surface vessels with input saturation and error constraints
    Qin, Hongde
    Li, Chengpeng
    Sun, Yanchao
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (05) : 356 - 363
  • [39] Reinforcement Learning-Based Optimal Battery Control Under Cycle-Based Degradation Cost
    Kwon, Kyung-bin
    Zhu, Hao
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (06) : 4909 - 4917
  • [40] Reinforcement learning-based tracking control for AUVs subject to disturbances
    Wang, Guangcang
    Zhang, Dianfeng
    Wu, Zhaojing
    [J]. 2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 2222 - 2227