Straight-Path Following for Underactuated Marine Vessels using Deep Reinforcement Learning

被引:54
作者
Martinsen, Andreas B. [1 ]
Lekkas, Anastasios M. [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, NO-7491 Trondheim, Norway
关键词
Deep reinforcement learning; path following; marine control systems; deep deterministic policy gradients;
D O I
10.1016/j.ifacol.2018.09.502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new framework, based on reinforcement learning, for solving the straight-path following problem for underactuated marine vessels under the influence of unknown ocean current. A dynamic model from the Marine Systems Simulator is employed to simulate the motion of a mariner-class vessel, however the policy search algorithm has no prior knowledge of the system it is assigned to control. A deep neural network is used as function approximator and the deep deterministic policy gradients method is employed to extract a suitable policy that minimizes the cross-track error. Two intuitive reward functions, which in addition prevent noisy rudder behavior, are proposed and compared. The simulation results demonstrate excellent performance, also in comparison with the line-of-sight guidance law. (C) 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
引用
收藏
页码:329 / 334
页数:6
相关论文
共 20 条
[1]   Trajectory-tracking and path-following of underactuated autonomous vehicles with parametric modeling uncertainty [J].
Aguiar, A. Pedro ;
Hespanha, Joao P. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (08) :1362-1379
[2]  
[Anonymous], 2016, P 4 INT C LEARN REPR
[3]  
[Anonymous], 2013, Playing atari with deep reinforcement learning
[4]  
[Anonymous], 1996, Neuro-dynamic programming
[5]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[6]  
Bertsekas D. P., 2012, DYNAMIC PROGRAMMING, VII
[7]   Integral Line-of-Sight Guidance and Control of Underactuated Marine Vehicles: Theory, Simulations, and Experiments [J].
Caharija, Walter ;
Pettersen, Kristin Y. ;
Bibuli, Marco ;
Calado, Pedro ;
Zereik, Enrica ;
Braga, Jose ;
Gravdahl, Jan Tommy ;
Sorensen, Asgeir J. ;
Milovanovic, Milan ;
Bruzzone, Gabriele .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2016, 24 (05) :1623-1642
[8]  
Fossen T.I., 2011, HDB MARINE CRAFT HYD, DOI [10.1002/9781119994138, DOI 10.1002/9781119994138]
[9]  
Fossen T.I., 2003, IFAC P VOLUMES, V36, P211, DOI 10.1016/S1474-6670(17)37809-6
[10]  
Fossen T.I., 2004, Marine Systems Simulator (MSS)