Straight-Path Following for Underactuated Marine Vessels using Deep Reinforcement Learning

被引:50
|
作者
Martinsen, Andreas B. [1 ]
Lekkas, Anastasios M. [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, NO-7491 Trondheim, Norway
来源
IFAC PAPERSONLINE | 2018年 / 51卷 / 29期
关键词
Deep reinforcement learning; path following; marine control systems; deep deterministic policy gradients;
D O I
10.1016/j.ifacol.2018.09.502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new framework, based on reinforcement learning, for solving the straight-path following problem for underactuated marine vessels under the influence of unknown ocean current. A dynamic model from the Marine Systems Simulator is employed to simulate the motion of a mariner-class vessel, however the policy search algorithm has no prior knowledge of the system it is assigned to control. A deep neural network is used as function approximator and the deep deterministic policy gradients method is employed to extract a suitable policy that minimizes the cross-track error. Two intuitive reward functions, which in addition prevent noisy rudder behavior, are proposed and compared. The simulation results demonstrate excellent performance, also in comparison with the line-of-sight guidance law. (C) 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
引用
收藏
页码:329 / 334
页数:6
相关论文
共 50 条
  • [41] Deep reinforcement learning for quadrotor path following with adaptive velocity
    Rubi, Bartomeu
    Morcego, Bernardo
    Perez, Ramon
    AUTONOMOUS ROBOTS, 2021, 45 (01) : 119 - 134
  • [42] Deep Reinforcement Learning for Concentric Tube Robot Path Following
    Iyengar, Keshav
    Spurgeon, Sarah
    Stoyanov, Danail
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (01): : 18 - 29
  • [43] Path Following for Autonomous Mobile Robots with Deep Reinforcement Learning
    Cao, Yu
    Ni, Kan
    Kawaguchi, Takahiro
    Hashimoto, Seiji
    SENSORS, 2024, 24 (02)
  • [44] Deep reinforcement learning for quadrotor path following with adaptive velocity
    Bartomeu Rubí
    Bernardo Morcego
    Ramon Pérez
    Autonomous Robots, 2021, 45 : 119 - 134
  • [45] Trajectory Tracking and Path Following for Underactuated Marine Vehicles
    Paliotta, Claudio
    Lefeber, Erjen
    Pettersen, Kristin Ytterstad
    Pinto, Jose
    Costa, Maria
    de Figueiredo Borges de Sousa, Joao Tasso
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 27 (04) : 1423 - 1437
  • [46] Unmanned surface vessels path following control using improved transferring reinforcement learning: Simulation and experiment
    Liu, Jiaxuan
    Xiao, Changshi
    Yuan, Haiwen
    Li, Changlong
    Li, Qiliang
    OCEAN ENGINEERING, 2025, 320
  • [47] Predictor-Based Line-of-Sight Guidance Law for Path Following of Underactuated Marine Surface Vessels
    Liu, Lu
    Wang, Dan
    Peng, Zhouhua
    2015 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2015, : 284 - 288
  • [48] Two-time scale path following of underactuated marine surface vessels: Design and stability analysis using singular perturbation methods
    Yi, Bowen
    Qiao, Lei
    Zhang, Weidong
    OCEAN ENGINEERING, 2016, 124 : 287 - 297
  • [49] Deep Reinforcement Learning Algorithms for Steering an Underactuated Ship
    Le Pham Tuyen
    Layek, Abu
    Ngo Anh Vien
    Chung, TaeChoong
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2017, : 602 - 607
  • [50] Taming an Autonomous Surface Vehicle for Path Following and Collision Avoidance Using Deep Reinforcement Learning
    Meyer, Eivind
    Robinson, Haakon
    Rasheed, Adil
    San, Omer
    IEEE ACCESS, 2020, 8 : 41466 - 41481