Straight-Path Following for Underactuated Marine Vessels using Deep Reinforcement Learning

被引:50
|
作者
Martinsen, Andreas B. [1 ]
Lekkas, Anastasios M. [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, NO-7491 Trondheim, Norway
来源
IFAC PAPERSONLINE | 2018年 / 51卷 / 29期
关键词
Deep reinforcement learning; path following; marine control systems; deep deterministic policy gradients;
D O I
10.1016/j.ifacol.2018.09.502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new framework, based on reinforcement learning, for solving the straight-path following problem for underactuated marine vessels under the influence of unknown ocean current. A dynamic model from the Marine Systems Simulator is employed to simulate the motion of a mariner-class vessel, however the policy search algorithm has no prior knowledge of the system it is assigned to control. A deep neural network is used as function approximator and the deep deterministic policy gradients method is employed to extract a suitable policy that minimizes the cross-track error. Two intuitive reward functions, which in addition prevent noisy rudder behavior, are proposed and compared. The simulation results demonstrate excellent performance, also in comparison with the line-of-sight guidance law. (C) 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
引用
收藏
页码:329 / 334
页数:6
相关论文
共 50 条
  • [21] PATH FOLLOWING CONTROL OF UNDERACTUATED MARINE VESSELS VIA DYNAMIC SURFACE CONTROL TECHNIQUE
    Oh, So-Ryeok
    Sun, Jing
    Li, Zhen
    PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE 2008, PTS A AND B, 2009, : 81 - 88
  • [22] Path Following Control for UAV Using Deep Reinforcement Learning Approach
    Yintao Zhang
    Youmin Zhang
    Ziquan Yu
    Guidance,Navigation and Control, 2021, (01) : 95 - 112
  • [23] Path Following for Formations of Underactuated Marine Vessels under Influence of Constant Ocean Currents
    Belleter, D. J. W.
    Pettersen, K. Y.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 4521 - 4528
  • [24] Practical Robust Neural Path Following Control for Underactuated Marine Vessels with Actuators Uncertainties
    Zhang, Guoqing
    Zhang, Xianku
    ASIAN JOURNAL OF CONTROL, 2017, 19 (01) : 173 - 187
  • [25] Path following of underactuated marine surface vessels using line-of-sight based model predictive control
    Oh, So-Ryeok
    Sun, Jing
    OCEAN ENGINEERING, 2010, 37 (2-3) : 289 - 295
  • [26] Straight Line Path Following for Formations of Underactuated Surface Vessels Under Influence of Constant Ocean Currents
    Burger, A.
    Pavlov, A.
    Borhaug, E.
    Pettersen, K. Y.
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 3065 - 3070
  • [27] Learning visual path–following skills for industrial robot using deep reinforcement learning
    Guoliang Liu
    Wenlei Sun
    Wenxian Xie
    Yangyang Xu
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 1099 - 1111
  • [28] Path Following with Deep Reinforcement Learning for Autonomous Cars
    Alomari, Khaled
    Mendoza, Ricardo Carrillo
    Goehring, Daniel
    Rojas, Raul
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS (ROBOVIS), 2021, : 173 - 181
  • [29] A Deep Reinforcement Learning Approach for Path Following on a Quadrotor
    Rubi, Bartomeu
    Morcego, Bernardo
    Perez, Ramon
    2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), 2020, : 1092 - 1098
  • [30] Observer Based Path Following for Underactuated Marine Vessels in the Presence of Ocean Currents: A Local Approach
    Maghenem, M.
    Belleter, D. J. W.
    Paliotta, C.
    Pettersen, K. Y.
    IFAC PAPERSONLINE, 2017, 50 (01): : 13654 - 13661