Straight-Path Following for Underactuated Marine Vessels using Deep Reinforcement Learning

被引:50
|
作者
Martinsen, Andreas B. [1 ]
Lekkas, Anastasios M. [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, NO-7491 Trondheim, Norway
来源
IFAC PAPERSONLINE | 2018年 / 51卷 / 29期
关键词
Deep reinforcement learning; path following; marine control systems; deep deterministic policy gradients;
D O I
10.1016/j.ifacol.2018.09.502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new framework, based on reinforcement learning, for solving the straight-path following problem for underactuated marine vessels under the influence of unknown ocean current. A dynamic model from the Marine Systems Simulator is employed to simulate the motion of a mariner-class vessel, however the policy search algorithm has no prior knowledge of the system it is assigned to control. A deep neural network is used as function approximator and the deep deterministic policy gradients method is employed to extract a suitable policy that minimizes the cross-track error. Two intuitive reward functions, which in addition prevent noisy rudder behavior, are proposed and compared. The simulation results demonstrate excellent performance, also in comparison with the line-of-sight guidance law. (C) 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
引用
收藏
页码:329 / 334
页数:6
相关论文
共 50 条
  • [1] Straight-Path Following and Formation Control of USVs Using Distributed Deep Reinforcement Learning and Adaptive Neural Network
    Zhengqing Han
    Yintao Wang
    Qi Sun
    IEEE/CAA Journal of Automatica Sinica, 2023, 10 (02) : 572 - 574
  • [2] Straight-Path Following and Formation Control of USVs Using Distributed Deep Reinforcement Learning and Adaptive Neural Network
    Han, Zhengqing
    Wang, Yintao
    Sun, Qi
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (02) : 572 - 574
  • [3] Straight Line Path Following for Formations of Underactuated Marine Surface Vessels
    Borhaug, Even
    Pavlov, Alexey
    Panteley, Elena
    Pettersen, Kristin Y.
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2011, 19 (03) : 493 - 506
  • [4] Path Following for Underactuated Marine Vessels
    Belleter, D. J. W.
    Paliotta, C.
    Maggiore, M.
    Pettersen, K. Y.
    IFAC PAPERSONLINE, 2016, 49 (18): : 588 - 593
  • [5] Straight-path tracking control of underactuated ships using dynamic nonlinear Sliding
    Navigation College, Dalian Maritime University, Dalian 116026, China
    Qinghua Daxue Xuebao, 2007, SUPPL. 2 (1880-1883):
  • [6] Deep reinforcement learning-based path planning of underactuated surface vessels
    Xu H.
    Wang N.
    Zhao H.
    Zheng Z.
    Cyber-Physical Systems, 2019, 5 (01): : 1 - 17
  • [7] Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vessels
    Cheng, Yin
    Zhang, Weidong
    NEUROCOMPUTING, 2018, 272 : 63 - 73
  • [8] Path Following Optimization for an Underactuated USV Using Smoothly-Convergent Deep Reinforcement Learning
    Zhao, Yujiao
    Qi, Xin
    Ma, Yong
    Li, Zhixiong
    Malekian, Reza
    Sotelo, Miguel Angel
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) : 6208 - 6220
  • [9] Straight-path Tracking Control of Underactuated Ship Based on Backstepping Method
    Liu Yong
    Bu Ren-Xiang
    Xu Hai-Jun
    2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015, 2015, : 223 - 227
  • [10] Multi-path Following for Underactuated USV Based on Deep Reinforcement Learning
    Wang, Zihao
    Wu, Yaoxin
    Song, Wen
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3525 - 3535