Knowledge transfer enabled reinforcement learning for efficient and safe autonomous ship collision avoidance

被引：2

作者：

Wang, Chengbo ^{[1
]}

Wang, Ning ^{[2
]}

Gao, Hongbo ^{[1
,3
,4
]}

Wang, Leihao ^{[5
]}

Zhao, Yizhuo ^{[1
]}

Fang, Mingxing ^{[6
]}

机构：

[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China

[2] Chongqing Coll Mobile Commun, Chongqing, Peoples R China

[3] Univ Sci & Technol China, Inst Adv Technol, Hefei, Peoples R China

[4] Nanyang Technol Univ, Singapore 639798, Singapore

[5] AVIC Leihua Elect Technol Res Inst, Wuxi, Peoples R China

[6] Anhui Normal Univ, Sch Phys & Elect Informat, Wuhu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2024年 / 15卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Collision avoidance decision-making; Autonomous ship; Deep reinforcement learning; Knowledge transfer;

D O I：

10.1007/s13042-024-02116-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Research on collision avoidance decision-making (CADM) for autonomous ships is a very challenging task in the shipping field. Considered one of the machine learning algorithms that has received considerable attention, reinforcement learning technology enables actions to be continually optimized by agents interacting with the environment, aiming to maximize rewards and returns. Significant potential is attributed to the research on autonomous ship collision avoidance. To investigate an efficient and practical ship collision avoidance algorithm, the knowledge transfer (KT) method is employed in this research to introduce an improved reinforcement learning approach. With a thorough understanding of ship collision avoidance behavior and the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs), a reward function is designed to guide and constrain ship collision avoidance behavior. Subsequently, ship collision avoidance tasks are categorized, and knowledge from source tasks is extracted and transferred to closely related target tasks. Experiments have been conducted across various collision avoidance tasks, encompassing diverse types and degrees of similarity. In multi-ships cases, the success rate of the learned knowledge applications of head-on, overtaking, and crossing encounter cases are 90%, 95%, and 82.5% respectively. The outcomes demonstrate that the proposed method enhances algorithmic efficiency while satisfying the requirements for safety and rule compliance in ship collision avoidance behavior. Furthermore, the methodology could also benefit other autonomous systems in dynamic environments.

引用

页码：3715 / 3731

页数：17

共 50 条

[1] Deep reinforcement learning-based collision avoidance for an autonomous ship
Chun, Do-Hyun
Roh, Myung-Il
Lee, Hye-Won
Ha, Jisang
Yu, Donghun
OCEAN ENGINEERING, 2021, 234
[2] TRANSFER REINFORCEMENT LEARNING: FEATURE TRANSFERABILITY IN SHIP COLLISION AVOIDANCE
Wang, Xinrui
Jin, Yan
PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B, 2023,
[3] Safe Reinforcement Learning for Pedestrian Collision Avoidance in Connected and Autonomous Vehicles
He, Ying
Zou, Guangyuan
Zhou, Guang
Pan, Weike
Ming, Zhong
AD HOC & SENSOR WIRELESS NETWORKS, 2025, 60 (1-2) : 141 - 169
[4] Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning
Zheng, Mao
Xie, Shuo
Chu, Xiumin
Zhu, Tianquan
Tian, Guohao
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
[5] Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation
Wang, Chengbo
Zhang, Xinyu
Yang, Zaili
Bashir, Musa
Lee, Kwangil
FRONTIERS IN MARINE SCIENCE, 2023, 9
[6] Safe and efficient collision avoidance control for autonomous vehicles
Wang, Qiang
Li, Dachuan
Sifakis, Joseph
2020 18TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2020, : 155 - 160
[7] CONTROL METHOD FOR PATH FOLLOWING AND COLLISION AVOIDANCE OF AUTONOMOUS SHIP BASED ON DEEP REINFORCEMENT LEARNING
Zhao, Luman
Roh, Myung-Il
Lee, Sung-Jun
JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2019, 27 (04): : 293 - 310
[8] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
Tseng, Hsiao-Ting
Hsieh, Chen-Chiung
Lin, Wei-Ting
Lin, Jyun-Ting
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
[9] WORK PROCESS TRANSFER REINFORCEMENT LEARNING: FEATURE EXTRACTION AND FINETUNING IN SHIP COLLISION AVOIDANCE
Wang, Xinrui
Jin, Yan
PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
[10] Autonomous spacecraft collision avoidance with a variable number of space debris based on safe reinforcement learning
Mu, Chaoxu
Liu, Shuo
Lu, Ming
Liu, Zhaoyang
Cui, Lei
Wang, Ke
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 149

← 1 2 3 4 5 →