Optimizing Multi-Vessel Collision Avoidance Decision Making for Autonomous Surface Vessels: A COLREGs-Compliant Deep Reinforcement Learning Approach

被引:7
|
作者
Xie, Weidong [1 ]
Gang, Longhui [1 ]
Zhang, Mingheng [2 ]
Liu, Tong [1 ]
Lan, Zhixun [1 ]
机构
[1] Dalian Maritime Univ, Coll Nav, Dalian 116026, Peoples R China
[2] Dalian Univ Technol, Sch Mech Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
automatic collision avoidance decision making; multi-ship encounter situations; deep reinforcement learning; COLREGs; MARINE VEHICLES; FUZZY-LOGIC; NAVIGATION; SYSTEM;
D O I
10.3390/jmse12030372
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Automatic collision avoidance decision making for vessels is a critical challenge in the development of autonomous ships and has become a central point of research in the maritime safety domain. Effective and systematic collision avoidance strategies significantly reduce the risk of vessel collisions, ensuring safe navigation. This study develops a multi-vessel automatic collision avoidance decision-making method based on deep reinforcement learning (DRL) and establishes a vessel behavior decision model. When designing the reward function for continuous action spaces, the criteria of the "Convention on the International Regulations for Preventing Collisions at Sea" (COLREGs) were adhered to, taking into account the vessel's collision risk under various encounter situations, real-world navigation practices, and navigational complexities. Furthermore, to enable the algorithm to precisely differentiate between collision avoidance and the navigation resumption phase in varied vessel encounter situations, this paper incorporated "collision avoidance decision making" and "course recovery decision making" as state parameters in the state set design, from which the respective objective functions were defined. To further enhance the algorithm's performance, techniques such as behavior cloning, residual networks, and CPU-GPU dual-core parallel processing modules were integrated. Through simulation experiments in the enhanced Imazu training environment, the practicality of the method, taking into account the effects of wind and ocean currents, was corroborated. The results demonstrate that the proposed algorithm can perform effective collision avoidance decision making in a range of vessel encounter situations, indicating its efficiency and robust generalization capabilities.
引用
收藏
页数:29
相关论文
共 37 条
  • [31] Decision-making of autonomous vehicles in interactions with jaywalkers: A risk-aware deep reinforcement learning approach
    Zhang, Ziqian
    Li, Haojie
    Chen, Tiantian
    Sze, N. N.
    Yang, Wenzhang
    Zhang, Yihao
    Ren, Gang
    ACCIDENT ANALYSIS AND PREVENTION, 2025, 210
  • [32] A Novel Ship Collision Avoidance Awareness Approach for Cooperating Ships Using Multi-Agent Deep Reinforcement Learning
    Chen, Chen
    Ma, Feng
    Xu, Xiaobin
    Chen, Yuwang
    Wang, Jin
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2021, 9 (10)
  • [33] A high-risk test scenario adaptive generation algorithm for ship autonomous collision avoidance decision-making based on Reinforcement Learning
    Zhu, Feixiang
    Niu, Yihan
    Wei, Moxuan
    Du, Yifan
    Zhai, Pengyu
    OCEAN ENGINEERING, 2025, 320
  • [34] Joint Optimization of Sensing, Decision-Making and Motion-Controlling for Autonomous Vehicles: A Deep Reinforcement Learning Approach
    Chen, Longquan
    He, Ying
    Wang, Qiang
    Pan, Weike
    Ming, Zhong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (05) : 4642 - 4654
  • [35] Multi-objective Longitudinal Decision-making for Autonomous Electric Vehicle: A Entropy-constrained Reinforcement Learning Approach
    He, Xiangkun
    Fei, Cong
    Liu, Yulong
    Yang, Kaiming
    Ji, Xuewu
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [36] Multi-agent deep reinforcement learning-based autonomous decision-making framework for community virtual power plants
    Li, Xiangyu
    Luo, Fengji
    Li, Chaojie
    APPLIED ENERGY, 2024, 360
  • [37] An Improved Approach towards Multi-Agent Pursuit-Evasion Game Decision-Making Using Deep Reinforcement Learning
    Wan, Kaifang
    Wu, Dingwei
    Zhai, Yiwei
    Li, Bo
    Gao, Xiaoguang
    Hu, Zijian
    ENTROPY, 2021, 23 (11)