Straight-Path Following for Underactuated Marine Vessels using Deep Reinforcement Learning

被引：54

作者：

Martinsen, Andreas B. ^{[1
]}

Lekkas, Anastasios M. ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, NO-7491 Trondheim, Norway

来源：

IFAC PAPERSONLINE | 2018年 / 51卷 / 29期

关键词：

Deep reinforcement learning; path following; marine control systems; deep deterministic policy gradients;

D O I：

10.1016/j.ifacol.2018.09.502

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a new framework, based on reinforcement learning, for solving the straight-path following problem for underactuated marine vessels under the influence of unknown ocean current. A dynamic model from the Marine Systems Simulator is employed to simulate the motion of a mariner-class vessel, however the policy search algorithm has no prior knowledge of the system it is assigned to control. A deep neural network is used as function approximator and the deep deterministic policy gradients method is employed to extract a suitable policy that minimizes the cross-track error. Two intuitive reward functions, which in addition prevent noisy rudder behavior, are proposed and compared. The simulation results demonstrate excellent performance, also in comparison with the line-of-sight guidance law. (C) 2018, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.

引用

页码：329 / 334

页数：6

共 20 条

[1] Trajectory-tracking and path-following of underactuated autonomous vehicles with parametric modeling uncertainty [J].

Aguiar, A. Pedro ;

Hespanha, Joao P. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (08) :1362-1379

[2]

[Anonymous], 2016, P 4 INT C LEARN REPR

[3]

[Anonymous], 2013, Playing atari with deep reinforcement learning

[4]

[Anonymous], 1996, Neuro-dynamic programming

[5]

[Anonymous], 2015, Reinforcement Learning: An Introduction

[6]

Bertsekas D. P., 2012, DYNAMIC PROGRAMMING, VII

[7] Integral Line-of-Sight Guidance and Control of Underactuated Marine Vehicles: Theory, Simulations, and Experiments [J].

Caharija, Walter ;

Pettersen, Kristin Y. ;

Bibuli, Marco ;

Calado, Pedro ;

Zereik, Enrica ;

Braga, Jose ;

Gravdahl, Jan Tommy ;

Sorensen, Asgeir J. ;

Milovanovic, Milan ;

Bruzzone, Gabriele .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2016, 24 (05) :1623-1642

[8]

Fossen T.I., 2011, HDB MARINE CRAFT HYD, DOI [10.1002/9781119994138, DOI 10.1002/9781119994138]

[9]

Fossen T.I., 2003, IFAC P VOLUMES, V36, P211, DOI 10.1016/S1474-6670(17)37809-6

[10]

Fossen T.I., 2004, Marine Systems Simulator (MSS)

← 1 2 →