Optimizing Multi-Vessel Collision Avoidance Decision Making for Autonomous Surface Vessels: A COLREGs-Compliant Deep Reinforcement Learning Approach

被引:12
作者
Xie, Weidong [1 ]
Gang, Longhui [1 ]
Zhang, Mingheng [2 ]
Liu, Tong [1 ]
Lan, Zhixun [1 ]
机构
[1] Dalian Maritime Univ, Coll Nav, Dalian 116026, Peoples R China
[2] Dalian Univ Technol, Sch Mech Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
automatic collision avoidance decision making; multi-ship encounter situations; deep reinforcement learning; COLREGs; MARINE VEHICLES; FUZZY-LOGIC; NAVIGATION; SYSTEM;
D O I
10.3390/jmse12030372
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Automatic collision avoidance decision making for vessels is a critical challenge in the development of autonomous ships and has become a central point of research in the maritime safety domain. Effective and systematic collision avoidance strategies significantly reduce the risk of vessel collisions, ensuring safe navigation. This study develops a multi-vessel automatic collision avoidance decision-making method based on deep reinforcement learning (DRL) and establishes a vessel behavior decision model. When designing the reward function for continuous action spaces, the criteria of the "Convention on the International Regulations for Preventing Collisions at Sea" (COLREGs) were adhered to, taking into account the vessel's collision risk under various encounter situations, real-world navigation practices, and navigational complexities. Furthermore, to enable the algorithm to precisely differentiate between collision avoidance and the navigation resumption phase in varied vessel encounter situations, this paper incorporated "collision avoidance decision making" and "course recovery decision making" as state parameters in the state set design, from which the respective objective functions were defined. To further enhance the algorithm's performance, techniques such as behavior cloning, residual networks, and CPU-GPU dual-core parallel processing modules were integrated. Through simulation experiments in the enhanced Imazu training environment, the practicality of the method, taking into account the effects of wind and ocean currents, was corroborated. The results demonstrate that the proposed algorithm can perform effective collision avoidance decision making in a range of vessel encounter situations, indicating its efficiency and robust generalization capabilities.
引用
收藏
页数:29
相关论文
共 40 条
[1]   A study on the collision avoidance of a ship using neural networks and fuzzy logic [J].
Ahn, Jin-Hyeong ;
Rhee, Key-Pyo ;
You, Young-Jun .
APPLIED OCEAN RESEARCH, 2012, 37 :162-173
[2]   Navigation of unmanned marine vehicles in accordance with the rules of the road [J].
Benjamin, Michael R. ;
Curcio, Joseph A. ;
Leonard, John J. ;
Newman, Paul M. .
2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, :3581-+
[3]  
Cai Y., 2013, C P, P191
[4]   Decision making and strategies in an interaction situation: Collision avoidance at sea [J].
Chauvin, Christine ;
Lardjane, Salim .
TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2008, 11 (04) :259-269
[5]  
Chen Y, 2007, PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, P2691
[6]  
Engstrom L, 2020, Arxiv, DOI arXiv:2005.12729
[7]   Estimation of vessel collision risk index based on support vector machine [J].
Gang, Longhui ;
Wang, Yonghui ;
Sun, Yao ;
Zhou, Liping ;
Zhang, Mingheng .
ADVANCES IN MECHANICAL ENGINEERING, 2016, 8 (11) :1-10
[8]   Online Prediction of Ship Behavior with Automatic Identification System Sensor Data Using Bidirectional Long Short-Term Memory Recurrent Neural Network [J].
Gao, Miao ;
Shi, Guoyou ;
Li, Shuang .
SENSORS, 2018, 18 (12)
[9]  
Geng H., 2018, Proceedings of ELM-2016
[10]  
Goodwin E.M., 1973, J. Navig, V26, P130, DOI [10.1017/S0373463300022992, DOI 10.1017/S0373463300022992]