Path-Following Control of Unmanned Underwater Vehicle Based on an Improved TD3 Deep Reinforcement Learning

被引:10
|
作者
Fan, Yexin [1 ]
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Denissenko, Petr [2 ]
机构
[1] Univ Warwick, Sch Engn, Intelligent Control & Smart Energy ICSE Res Grp, Coventry CV4 7AL, England
[2] Univ Warwick, Sch Engn, Coventry CV4 7AL, England
基金
英国工程与自然科学研究理事会;
关键词
Vehicle dynamics; Attitude control; Uncertainty; Training; Oscillators; Convergence; Complexity theory; Deep reinforcement learning (DRL); intelligent control; path-following control; unmanned underwater vehicles (UUVs); ADAPTIVE-CONTROL; TRACKING CONTROL; AUV;
D O I
10.1109/TCST.2024.3377876
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes an innovative path-following control method, anchored in deep reinforcement learning (DRL), for unmanned underwater vehicles (UUVs). This approach is driven by several new designs, all of which aim to enhance learning efficiency and effectiveness and achieve high-performance UUV control. Specifically, a novel experience replay strategy is designed and integrated within the twin-delayed deep deterministic policy gradient algorithm (TD3). It distinguishes the significance of stored transitions by making a trade-off between rewards and temporal-difference (TD) errors, thus enabling the UUV agent to explore optimal control policies more efficiently. Another major challenge within this control problem arises from action oscillations associated with DRL policies. This issue leads to excessive system wear on actuators and makes real-time application difficult. To mitigate this challenge, a newly improved regularization method is proposed, which provides a moderate level of smoothness to the control policy. Furthermore, a dynamic reward function featuring adaptive constraints is designed to avoid unproductive exploration and expedite learning convergence speed further. Simulation results show that our method garners higher rewards in fewer training episodes compared with mainstream DRL-based control approaches (e.g., deep deterministic policy gradient (DDPG) and vanilla TD3) in UUV applications. Moreover, it can adapt to varying path configurations amid uncertainties and disturbances, all while ensuring high tracking accuracy. Simulation and experimental studies are conducted to verify the effectiveness.
引用
收藏
页码:1904 / 1919
页数:16
相关论文
共 50 条
  • [1] Path-following optimal control of autonomous underwater vehicle based on deep reinforcement learning
    Wang, Zhanyuan
    Li, Yulong
    Ma, Caipeng
    Yan, Xun
    Jiang, Dapeng
    OCEAN ENGINEERING, 2023, 268
  • [2] THREE-DIMENSIONAL PATH-FOLLOWING CONTROL OF AN AUTONOMOUS UNDERWATER VEHICLE BASED ON DEEP REINFORCEMENT LEARNING
    Liang, Zhenyu
    Qu, Xingru
    Zhang, Zhao
    Chen, Cong
    POLISH MARITIME RESEARCH, 2022, 29 (04) : 36 - 44
  • [3] USV Path-Following Control Based On Deep Reinforcement Learning and Adaptive Control
    Gonzalez-Garcia, Alejandro
    Castaneda, Herman
    Garrido, Leonardo
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [4] Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning
    Zhang, Kai
    Wang, Guile
    Hu, Jinwen
    Xu, Zhao
    Guo, Chubing
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8392 - 8397
  • [5] Deep Reinforcement Learning With Multicritic TD3 for Decentralized Multirobot Path Planning
    Yin, Heqing
    Wang, Chang
    Yan, Chao
    Xiang, Xiaojia
    Cai, Boliang
    Wei, Changyun
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (04) : 1233 - 1247
  • [6] Deep reinforcement learning-based controller for path following of an unmanned surface vehicle
    Woo, Joohyun
    Yu, Chanwoo
    Kim, Nakwan
    OCEAN ENGINEERING, 2019, 183 : 155 - 166
  • [7] Single degree of freedom control based on deep reinforcement learning for underwater unmanned vehicle
    Li, Nan
    Zhao, Changming
    Shi, Yan
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024,
  • [8] A novel energy management strategy of hybrid electric vehicle via an improved TD3 deep reinforcement learning
    Zhou, Jianhao
    Xue, Siwu
    Xue, Yuan
    Liao, Yuhui
    Liu, Jun
    Zhao, Wanzhong
    ENERGY, 2021, 224
  • [9] Research on Path-Following Technology of a Single-Outboard-Motor Unmanned Surface Vehicle Based on Deep Reinforcement Learning and Model Predictive Control Algorithm
    Cui, Bin
    Chen, Yuanming
    Hong, Xiaobin
    Luo, Hao
    Chen, Guanqiao
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (12)
  • [10] Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle
    Zhang, Qilei
    Lin, Jinying
    Sha, Qixin
    He, Bo
    Li, Guangliang
    IEEE ACCESS, 2020, 8 : 24258 - 24268