Speed and heading control of an unmanned surface vehicle using deep reinforcement learning

被引:3
作者
Wu, Ting [1 ]
Ye, Hui [1 ]
Xiang, Zhengrong [2 ]
Yang, Xiaofei [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Automat, Zhenjiang 212100, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Peoples R China
来源
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS | 2023年
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; DDPG algorithm; unmanned surface vehicle;
D O I
10.1109/DDCLS58216.2023.10166143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a deep reinforcement learning-based speed and heading control method is proposed for an unmanned surface vehicle (USV). A deep deterministic policy gradient (DDPG) algorithm which combines with an actor-critic reinforcement learning mechanism, is adopted to provide continuous control variables by interacting with the environment. Moreover, two types of reward functions are created for speed and heading control of the USV. The control policy is trained by trial and error so that the USV can be guided to achieve the desired speed and heading angle steadily and rapidly. Simulation results verify the feasibility and effectiveness of the proposed approach by comparisons with classical PID control and S plane control.
引用
收藏
页码:573 / 578
页数:6
相关论文
共 13 条
[1]   A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning [J].
Hao, Bing ;
Du, He ;
Yan, Zheping .
OCEAN ENGINEERING, 2023, 270
[2]   Global tracking control of underactuated ships by Lyapunov's direct method [J].
Jiang, ZP .
AUTOMATICA, 2002, 38 (02) :301-309
[3]   Second-order sliding-mode controller for autonomous underwater vehicle in the presence of unknown disturbances [J].
Joe, Hangil ;
Kim, Minsung ;
Yu, Son-cheol .
NONLINEAR DYNAMICS, 2014, 78 (01) :183-196
[4]  
Li B., 2020, MEASUREMENT TECHNOLO, V61, P14
[5]  
Li Y., 2013, T TECH PUBLICATIONS, V437, P716
[6]   Unmanned surface vehicles.: An overview of developments and challenges [J].
Liu, Zhixiang ;
Zhang, Youmin ;
Yu, Xiang ;
Yuan, Chi .
ANNUAL REVIEWS IN CONTROL, 2016, 41 :71-93
[7]  
Skjetne R, 2004, IFAC Proc., V37, P203
[8]  
Wang Y, 2018, 2018 OCEANS - MTS/IEEE KOBE TECHNO-OCEANS (OTO)
[9]   Collision avoidance for an unmanned surface vehicle using deep reinforcement learning [J].
Woo, Joohyun ;
Kim, Nakwan .
OCEAN ENGINEERING, 2020, 199
[10]  
Xia J., 2022, J HUAZHONG U SCI TEC, P1