A path planning strategy unified with a COLREGS collision avoidance function based on deep reinforcement learning and artificial potential field

被引：225

作者：

Li, Lingyu ^{[1
]}

Wu, Defeng ^{[1
,2
]}

Huang, Youqiang ^{[1
]}

Yuan, Zhi-Ming ^{[3
]}

机构：

[1] Jimei Univ, Sch Marine Engn, Xiamen 361021, Peoples R China

[2] Fujian Prov Key Lab Naval Architecture & Ocean En, Xiamen 361021, Peoples R China

[3] Univ Strathclyde, Dept Naval Architecture Ocean & Marine Engn, Glasgow G4 0LZ, Lanark, Scotland

来源：

APPLIED OCEAN RESEARCH | 2021年 / 113卷

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; Path planning; Artificial potential field; COLREGS collision avoidance;

D O I：

10.1016/j.apor.2021.102759

中图分类号：

P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Improving the autopilot capability of ships is particularly important to ensure the safety of maritime navigation. The unmanned surface vessel (USV) with autopilot capability is a development trend of the ship of the future. The objective of this paper is to investigate the path planning problem of USVs in uncertain environments, and a path planning strategy unified with a collision avoidance function based on deep reinforcement learning (DRL) is proposed. A Deep Q-learning network (DQN) is used to continuously interact with the visually simulated environment to obtain experience data, so that the agent learns the best action strategies in the visual simulated environment. To solve the collision avoidance problems that may occur during USV navigation, the location of the obstacle ship is divided into four collision avoidance zones according to the International Regulations for Preventing Collisions at Sea (COLREGS). To obtain an improved DRL algorithm, the artificial potential field (APF) algorithm is utilized to improve the action space and reward function of the DQN algorithm. A simulation experiments is utilized to test the effects of our method in various situations. It is also shown that the enhanced DRL can effectively realize autonomous collision avoidance path planning.

引用

页数：16

共 39 条

[31] Active disturbance rejection controller design for dynamically positioned vessels based on adaptive hybrid biogeography-based optimization and differential evolution [J].

Wu, Defeng ;

Ren, Fengkun ;

Qiao, Lei ;

Zhang, Weidong .

ISA TRANSACTIONS, 2018, 78 :56-65

[32]

Wu DF, 2011, INT J COMPUT APPL T, V41, P53, DOI 10.1504/IJCAT.2011.042232

[33]

Xia G., 2020, COMPLEXITY 2020

[34] Ship predictive collision avoidance method based on an improved beetle antennae search algorithm [J].

Xie, Shuo ;

Chu, Xiumin ;

Zheng, Mao ;

Liu, Chenguang .

OCEAN ENGINEERING, 2019, 192

[35] A Path-Planning Strategy for Unmanned Surface Vehicles Based on an Adaptive Hybrid Dynamic Stepsize and Target Attractive Force-RRT Algorithm [J].

Zhang, Zhen ;

Wu, Defeng ;

Gu, Jiadong ;

Li, Fusheng .

JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2019, 7 (05)

[36] COLREGs-compliant multiship collision avoidance based on deep reinforcement learning [J].

Zhao, Luman ;

Roh, Myung-Il .

OCEAN ENGINEERING, 2019, 191

[37] A novel analytic framework of real-time multi-vessel collision risk assessment for maritime traffic surveillance [J].

Zhen, Rong ;

Riveiro, Maria ;

Jin, Yongxing .

OCEAN ENGINEERING, 2017, 145 :492-501

[38]

Zhu BY, 2017, CHIN AUTOM CONGR, P4973, DOI 10.1109/CAC.2017.8243661

[39] Single-parameter-learning-based finite-time tracking control of underactuated MSVs under input saturation [J].

Zhu, Guibing ;

Ma, Yong ;

Hu, Songlin .

CONTROL ENGINEERING PRACTICE, 2020, 105

← 1 2 3 4 →