A partially observable multi-ship collision avoidance decision-making model based on deep reinforcement learning

被引:19
作者
Zheng, Kangjie [1 ]
Zhang, Xinyu [1 ]
Wang, Chengbo [1 ]
Zhang, Mingyang [2 ]
Cui, Hao [1 ]
机构
[1] Dalian Maritime Univ, Dept Nav Coll, Maritime Intelligent Transportat Res Team, Dalian, Peoples R China
[2] Aalto Univ, Dept Mech Engn, Marine Technol Grp, Espoo, Finland
关键词
Collision avoidance decision-making; POMDP; Dense reward; G-IPOMDP-PPO; Mixed obstacle environments; NAVIGATION;
D O I
10.1016/j.ocecoaman.2023.106689
中图分类号
P7 [海洋学];
学科分类号
0707 ;
摘要
Unmanned ships have drawn widespread attention for their potential to enhance navigational safety, minimize human errors, and improve shipping efficiency. Nevertheless, the complexity and uncertainty of mixed obstacle environments present significant challenges to developing unmanned ships, particularly in collision avoidance decision-making. This paper proposes a new model using the Partially Observable Markov Decision Process (POMDP) to construct a collision avoidance decision-making model in mixed obstacle environments for autonomous ships, which can address the environment's complexity and uncertainty and improve decision accuracy. An image-state observation method is proposed as images can provide more accurate, rich, and reliable information. A dense reward function is designed to address the issue of sparse rewards in fitting the algorithm. The Proximal Policy Optimization (PPO) algorithm is utilized for model training. Based on this, a route guidance method called the PPO for POMDP with guidelines under dense reward (G-IPOMDP-PPO) is proposed, which can improve training efficiency. Simulations are conducted in various mixed obstacle environments and compared with conventional algorithms. The results show that the proposed model can safely and efficiently make collision avoidance decisions in complex and uncertain environments. This research provides a new solution and theoretical foundation for developing autonomous ships and can be extended to achieving dynamic interactive collision avoidance in mixed obstacle environments.
引用
收藏
页数:13
相关论文
共 52 条
[1]   Nonlinear Model Predictive Control for trajectory tracking and collision avoidance of underactuated vessels with disturbances [J].
Abdelaal, Mohamed ;
Fraenzle, Martin ;
Hahn, Axel .
OCEAN ENGINEERING, 2018, 160 :168-180
[2]   Fullest COLREGs Evaluation Using Fuzzy Logic for Collaborative Decision-Making Analysis of Autonomous Ships in Complex Situations [J].
Bakdi, Azzeddine ;
Vanem, Erik .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :18433-18445
[3]   Identifying factors influencing total-loss marine accidents in the world: Analysis and evaluation based on ship types and sea regions [J].
Chen, Jihong ;
Bian, Wentao ;
Wan, Zheng ;
Yang, Zaili ;
Zheng, Huiying ;
Wang, Paopao .
OCEAN ENGINEERING, 2019, 191
[4]   The relationship between the development of global maritime fleets and GHG emission from shipping [J].
Chen, Jihong ;
Fei, Yijie ;
Wan, Zheng .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 242 :31-39
[5]   Deep reinforcement learning-based collision avoidance for an autonomous ship [J].
Chun, Do-Hyun ;
Roh, Myung-Il ;
Lee, Hye-Won ;
Ha, Jisang ;
Yu, Donghun .
OCEAN ENGINEERING, 2021, 234
[6]   Collision risk assessment for ships? routeing waters: An information entropy approach with Automatic Identification System (AIS) data [J].
Feng, Hongxiang ;
Grifoll, Manel ;
Yang, Zhongzhen ;
Zheng, Pengjun .
OCEAN & COASTAL MANAGEMENT, 2022, 224
[7]   Hybrid Dec-POMDP/PID Guidance System for Formation Flight of Multiple UAVs [J].
Floriano, Bruno R. O. ;
Borges, Geovany A. ;
Ferreira, Henrique C. ;
Ishihara, Joao Y. .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 101 (03)
[8]   Research on ship collision avoidance path planning based on modified potential field ant colony algorithm [J].
Gao, Pan ;
Zhou, Li ;
Zhao, Xu ;
Shao, Bo .
OCEAN & COASTAL MANAGEMENT, 2023, 235
[9]   Scenario-Based Model Predictive Control with Several Steps for COLREGS Compliant Ship Collision Avoidance [J].
Hagen, I. B. ;
Kufoalor, D. K. M. ;
Johansen, T. A. ;
Brekke, E. F. .
IFAC PAPERSONLINE, 2022, 55 (31) :307-312
[10]  
Hu J., 2022, Industrial Engineering Innovation and Management, V5, P20