Knowledge transfer enabled reinforcement learning for efficient and safe autonomous ship collision avoidance

被引:2
|
作者
Wang, Chengbo [1 ]
Wang, Ning [2 ]
Gao, Hongbo [1 ,3 ,4 ]
Wang, Leihao [5 ]
Zhao, Yizhuo [1 ]
Fang, Mingxing [6 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[2] Chongqing Coll Mobile Commun, Chongqing, Peoples R China
[3] Univ Sci & Technol China, Inst Adv Technol, Hefei, Peoples R China
[4] Nanyang Technol Univ, Singapore 639798, Singapore
[5] AVIC Leihua Elect Technol Res Inst, Wuxi, Peoples R China
[6] Anhui Normal Univ, Sch Phys & Elect Informat, Wuhu, Peoples R China
基金
中国国家自然科学基金;
关键词
Collision avoidance decision-making; Autonomous ship; Deep reinforcement learning; Knowledge transfer;
D O I
10.1007/s13042-024-02116-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research on collision avoidance decision-making (CADM) for autonomous ships is a very challenging task in the shipping field. Considered one of the machine learning algorithms that has received considerable attention, reinforcement learning technology enables actions to be continually optimized by agents interacting with the environment, aiming to maximize rewards and returns. Significant potential is attributed to the research on autonomous ship collision avoidance. To investigate an efficient and practical ship collision avoidance algorithm, the knowledge transfer (KT) method is employed in this research to introduce an improved reinforcement learning approach. With a thorough understanding of ship collision avoidance behavior and the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs), a reward function is designed to guide and constrain ship collision avoidance behavior. Subsequently, ship collision avoidance tasks are categorized, and knowledge from source tasks is extracted and transferred to closely related target tasks. Experiments have been conducted across various collision avoidance tasks, encompassing diverse types and degrees of similarity. In multi-ships cases, the success rate of the learned knowledge applications of head-on, overtaking, and crossing encounter cases are 90%, 95%, and 82.5% respectively. The outcomes demonstrate that the proposed method enhances algorithmic efficiency while satisfying the requirements for safety and rule compliance in ship collision avoidance behavior. Furthermore, the methodology could also benefit other autonomous systems in dynamic environments.
引用
收藏
页码:3715 / 3731
页数:17
相关论文
共 50 条
  • [1] Deep reinforcement learning-based collision avoidance for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Ha, Jisang
    Yu, Donghun
    OCEAN ENGINEERING, 2021, 234
  • [2] TRANSFER REINFORCEMENT LEARNING: FEATURE TRANSFERABILITY IN SHIP COLLISION AVOIDANCE
    Wang, Xinrui
    Jin, Yan
    PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B, 2023,
  • [3] Safe Reinforcement Learning for Pedestrian Collision Avoidance in Connected and Autonomous Vehicles
    He, Ying
    Zou, Guangyuan
    Zhou, Guang
    Pan, Weike
    Ming, Zhong
    AD HOC & SENSOR WIRELESS NETWORKS, 2025, 60 (1-2) : 141 - 169
  • [4] Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning
    Zheng, Mao
    Xie, Shuo
    Chu, Xiumin
    Zhu, Tianquan
    Tian, Guohao
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
  • [5] Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation
    Wang, Chengbo
    Zhang, Xinyu
    Yang, Zaili
    Bashir, Musa
    Lee, Kwangil
    FRONTIERS IN MARINE SCIENCE, 2023, 9
  • [6] Safe and efficient collision avoidance control for autonomous vehicles
    Wang, Qiang
    Li, Dachuan
    Sifakis, Joseph
    2020 18TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2020, : 155 - 160
  • [7] CONTROL METHOD FOR PATH FOLLOWING AND COLLISION AVOIDANCE OF AUTONOMOUS SHIP BASED ON DEEP REINFORCEMENT LEARNING
    Zhao, Luman
    Roh, Myung-Il
    Lee, Sung-Jun
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2019, 27 (04): : 293 - 310
  • [8] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Lin, Wei-Ting
    Lin, Jyun-Ting
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [9] WORK PROCESS TRANSFER REINFORCEMENT LEARNING: FEATURE EXTRACTION AND FINETUNING IN SHIP COLLISION AVOIDANCE
    Wang, Xinrui
    Jin, Yan
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
  • [10] Autonomous spacecraft collision avoidance with a variable number of space debris based on safe reinforcement learning
    Mu, Chaoxu
    Liu, Shuo
    Lu, Ming
    Liu, Zhaoyang
    Cui, Lei
    Wang, Ke
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 149