Dynamic collision avoidance for maritime autonomous surface ships based on deep Q-network with velocity obstacle method

被引：1

作者：

Li, Yuqin ^{[1
]}

Wu, Defeng ^{[1
,2
,3
]}

Wang, Hongdong ^{[4
]}

Lou, Jiankun ^{[4
]}

机构：

[1] Jimei Univ, Sch Marine Engn, Xiamen 361021, Peoples R China

[2] Fujian Prov Key Lab Naval Architecture & Ocean Eng, Xiamen 361021, Peoples R China

[3] Fujian Inst Innovat Marine Equipment Detect & Remf, Huzhou, Peoples R China

[4] Shanghai Jiao Tong Univ, Key Lab Marine Intelligent Equipment & Syst, Minist Educ, Shanghai 200240, Peoples R China

来源：

OCEAN ENGINEERING | 2025年 / 320卷

基金：

中国国家自然科学基金;

关键词：

Collision avoidance; COLREGs; Deep Q-network algorithm; Random collision scenario generation; Velocity Obstacle algorithm;

D O I：

10.1016/j.oceaneng.2025.120335

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

To address the dynamic obstacle environment collision avoidance challenge of the marine autonomous surface ships (MASS), a decision-making method based on the deep Q-learning (DQN) and velocity obstacle (VO) algorithm is proposed. Firstly, the encounter situation identification criteria are optimized, and a method for random collision scenario generation is designed. The model's performance is comprehensively evaluated by generating a wide variety of random collision scenarios which provide a broader assessment compared to manually set scenarios. Furthermore, a complete reward function for the dynamic collision avoidance problem is proposed, in which combines ship collision risk, the velocity obstacle method, and the International Regulations for Preventing Collisions at Sea (COLREGs). The MASS is not only guided towards the target by this reward function but is also ensured to comply with COLREGs during the collision avoidance process. It is worth noting that the trained model does not require retraining when faced with different numbers of target ships (TS). Simulation experiments are conducted with the trained model, involving random encounters with 1 to 10 TS in open waters. The results indicate that the proposed method demonstrates better collision avoidance performance compared to the DQN and proximal policy optimization algorithms.

引用

页数：14

共 35 条

[1] USV collision hazard assessment and track planning algorithm [J].

Chen, Yan-Li ;

Du, Wei-Kang ;

Hu, Xin-Yu ;

Bai, Gui-Qiang ;

Zhang, Jia-Bao .

OCEAN ENGINEERING, 2022, 261

[2] Deep reinforcement learning-based collision avoidance for an autonomous ship [J].

Chun, Do-Hyun ;

Roh, Myung-Il ;

Lee, Hye-Won ;

Ha, Jisang ;

Yu, Donghun .

OCEAN ENGINEERING, 2021, 234

[3] Collision avoidance decision-making strategy for multiple USVs based on Deep Reinforcement Learning algorithm [J].

Cui, Zhewen ;

Guan, Wei ;

Zhang, Xianku .

OCEAN ENGINEERING, 2024, 308

[4] Intelligent navigation method for multiple marine autonomous surface ships based on improved PPO algorithm [J].

Cui, Zhewen ;

Guan, Wei ;

Luo, Wenzhe ;

Zhang, Xianku .

OCEAN ENGINEERING, 2023, 287

[5]

European Maritime Safety Agency, 2023, More than 10k ships involved in navigational incidents in the last decade

[6] A novel intelligent collision avoidance algorithm based on deep reinforcement learning approach for USV [J].

Fan, Yunsheng ;

Sun, Zhe ;

Wang, Guofeng .

OCEAN ENGINEERING, 2023, 287

[7] Intelligent decision-making system for multiple marine autonomous surface ships based on deep reinforcement learning [J].

Guan, Wei ;

Luo, Wenzhe ;

Cui, Zhewen .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 172

[8]

Hong ZW, 2018, ADV NEUR IN, V31

[9] A human-like collision avoidance method for autonomous ship with attention-based deep reinforcement learning [J].

Jiang, Lingling ;

An, Lanxuan ;

Zhang, Xinyu ;

Wang, Chengbo ;

Wang, Xinjian .

OCEAN ENGINEERING, 2022, 264

[10] Ship Collision Avoidance and COLREGS Compliance Using Simulation-Based Control Behavior Selection With Predictive Hazard Assessment [J].

Johansen, Tor Arne ;

Perez, Tristan ;

Cristofaro, Andrea .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (12) :3407-3422

← 1 2 3 4 →