Heading Control of a Ship Based on Deep Reinforcement Learning (RL)

被引:6
作者
Sivaraj, Sivaraman [1 ]
Rajendran, Suresh [1 ]
机构
[1] Indian Inst Technol Madras, Dept Ocean Engn, Chennai 600036, Tamil Nadu, India
来源
OCEANS 2022 | 2022年
关键词
Heading Control; LOS; KVLCC2; Salvenson Method; Cross Track Error; DQN;
D O I
10.1109/OCEANSChennai45887.2022.9775236
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
A deep reinforcement learning (RL) algorithm called Deep Q-Network(DQN) is used for the heading control of a ship in calm water and in waves. Ship's state space is in continuous space and a set of five rudder angle actions are in discrete space. The optimal rudder action is selected based on maximum Q-value of the rudder actions. The ship positions and velocities serve as input and the Q-values associated with a set of rudder angles are the output of the DQN. Reward functions are designed such that the agent will try to reduce the Cross Track Error (CTE) and Heading Error (HE). The heading control of a KVLCC2 tanker in calm water and waves is investigated. The ship dynamics is represented using a 3DoF numerical model. The CTE and HE are calculated based on Line of Sight (LOS) Algorithm.
引用
收藏
页数:6
相关论文
共 13 条
[1]  
[Anonymous], 2013, PROC INT C NEURAL IN
[2]  
De S., 2018, ARXIV180706766
[3]  
Kingma D. P., 2015, P INT C LEARN REPR, P1
[4]  
Lekkas A. M., 2013, Advanced in marine robotics, V5, P63
[5]   Reinforcement Learning-Based Tracking Control of USVs in Varying Operational Conditions [J].
Martinsen, Andreas B. ;
Lekkas, Anastasios M. ;
Gros, Sebastien ;
Glomsrud, Jon Arne ;
Pedersen, Tom Arne .
FRONTIERS IN ROBOTICS AND AI, 2020, 7
[6]   Straight-Path Following for Underactuated Marine Vessels using Deep Reinforcement Learning [J].
Martinsen, Andreas B. ;
Lekkas, Anastasios M. .
IFAC PAPERSONLINE, 2018, 51 (29) :329-334
[7]   Path following control system for a tanker ship model [J].
Moreira, Lucia ;
Fossen, Thor I. ;
Soares, C. Guedes .
OCEAN ENGINEERING, 2007, 34 (14-15) :2074-2085
[8]   A unified seakeeping and manoeuvring model with a PID controller for path following of a KVLCC2 tanker in regular waves [J].
Paramesh, S. ;
Rajendran, Suresh .
APPLIED OCEAN RESEARCH, 2021, 116
[9]  
Salvesen N., 1974, INT S DYN MAR VEH ST
[10]   A unified ship manoeuvring model with a nonlinear model predictive controller for path following in regular waves [J].
Sandeepkumar, R. ;
Rajendran, Suresh ;
Mohan, Ranjith ;
Pascoal, Antonio .
OCEAN ENGINEERING, 2022, 243