Deep reinforcement learning with dynamic window approach based collision avoidance path planning for maritime autonomous surface ships

被引:42
作者
Wu, Chuanbo [1 ,3 ]
Yu, Wangneng [1 ,2 ,3 ]
Li, Guangze [1 ,3 ]
Liao, Weiqiang [1 ,2 ,3 ]
机构
[1] Jimei Univ, Sch Marine Engn, Xiamen 361021, Peoples R China
[2] Fujian Prov Key Lab Naval Architecture & Ocean Eng, Xiamen 361021, Peoples R China
[3] Fujian Engn & Res Ctr Offshore Small Green Intelli, Xiamen 361021, Peoples R China
基金
中国国家自然科学基金;
关键词
Ship collision avoidance; Dynamic window approach; Deep reinforcement learning; Maritime autonomous surface ships; OPTIMIZATION; ALGORITHM;
D O I
10.1016/j.oceaneng.2023.115208
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Automatic obstacle avoidance technology is one of the key technologies for ship intelligence. The purpose of this paper is to investigate the obstacle avoidance problem of maritime autonomous surface ships(MASS) in a complex offshore environment, and an obstacle avoidance strategy based on deep reinforcement learning and a dynamic window algorithm was proposed. To solve the collision avoidance problems that may occur during intelligent ship navigation, the action space of the proximal policy optimization (PPO) algorithm is defined according to the description of ship motion by linear and angular velocity in the dynamic window approach (DWA). The maximum detection distance of the MASS is utilized to construct the ship safety domain, which determines the state space containing the information of this ship and the nearest obstacle. To solve the problem of sparse reward, the reward function of the PPO is improved by combining the evaluation functions for distance, velocity and heading in the DWA. To verify the effectiveness of the algorithm, simulation experiments are performed in various situations. It is also shown that the improved algorithm can make the optimal collision avoidance decision from the complex environment and can effectively realize autonomous collision avoidance path planning for the MASS.
引用
收藏
页数:16
相关论文
共 35 条
[1]   Internet of Ships: A Survey on Architectures, Emerging Applications, and Challenges [J].
Aslam, Sheraz ;
Michaelides, Michalis P. ;
Herodotou, Herodotos .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) :9714-9727
[2]   A knowledge-free path planning approach for smart ships based on reinforcement learning [J].
Chen, Chen ;
Chen, Xian-Qiao ;
Ma, Feng ;
Zeng, Xiao-Jun ;
Wang, Jin .
OCEAN ENGINEERING, 2019, 189
[3]   Path Planning and Obstacle Avoiding of the USV Based on Improved ACO-APF Hybrid Algorithm With Adaptive Early-Warning [J].
Chen, Yanli ;
Bai, Guiqiang ;
Zhan, Yin ;
Hu, Xinyu ;
Liu, Jun .
IEEE ACCESS, 2021, 9 :40728-40742
[4]   Deep reinforcement learning-based collision avoidance for an autonomous ship [J].
Chun, Do-Hyun ;
Roh, Myung-Il ;
Lee, Hye-Won ;
Ha, Jisang ;
Yu, Donghun .
OCEAN ENGINEERING, 2021, 234
[5]  
Chunyu Ju, 2020, 2020 11th International Conference on Prognostics and System Health Management (PHM-2020 Jinan), P23, DOI 10.1109/PHM-Jinan48558.2020.00012
[6]  
Fossen T.I., 2003, IFAC Proc.Vol, V36, P211, DOI 10.1016/S1474-6670(17)37809-6
[7]   An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning [J].
Guo, Siyu ;
Zhang, Xiuguo ;
Zheng, Yisong ;
Du, Yiquan .
SENSORS, 2020, 20 (02)
[8]   Global path planning and multi-objective path control for unmanned surface vehicle based on modified particle swarm optimization (PSO) algorithm [J].
Guo, Xinghai ;
Ji, Mingjun ;
Zhao, Ziwei ;
Wen, Dusu ;
Zhang, Weidan .
OCEAN ENGINEERING, 2020, 216
[9]   Dynamic anti-collision A-star algorithm for multi-ship encounter situations [J].
He, Zhibo ;
Liu, Chenguang ;
Chu, Xiumin ;
Negenborn, Rudy R. ;
Wu, Qing .
APPLIED OCEAN RESEARCH, 2022, 118
[10]   Reinforcement Learning-Based Collision Avoidance and Optimal Trajectory Planning in UAV Communication Networks [J].
Hsu, Yu-Hsin ;
Gau, Rung-Hung .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (01) :306-320