Deep reinforcement learning-based collision avoidance for an autonomous ship

被引:104
|
作者
Chun, Do-Hyun [1 ]
Roh, Myung-Il [1 ,2 ]
Lee, Hye-Won [3 ]
Ha, Jisang [1 ]
Yu, Donghun [1 ]
机构
[1] Seoul Natl Univ, Dept Naval Architecture & Ocean Engn, 1 Gwanak Ro, Seoul 08826, South Korea
[2] Seoul Natl Univ, Res Inst Marine Syst Engn, 1 Gwanak Ro, Seoul 08826, South Korea
[3] Seoul Natl Univ, Res Inst Marine Syst Engn, Seoul, South Korea
关键词
Collision avoidance; Autonomous ship; Collision  risk; COLREGs; Deep  reinforcement learning; SIMULATION; BEHAVIOR;
D O I
10.1016/j.oceaneng.2021.109216
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Social interest in autonomous navigation systems for autonomous ships is also increasing. For a robust autonomous navigation system, the location, speed, and direction of the ship and other ships must be identified in real time, and collision avoidance should be performed at an appropriate time by considering the collision risk. In this study, we proposed a collision avoidance method that quantitatively assesses the collision risk and then generates an avoidance path. First, to assess the collision risk, a collision risk assessment method based on the ship domain and the closest point of approach (CPA) was proposed. The ship domain is created with an asymmetric shape considering manoeuvring performance and the COLREGs. The CPA is used to assess quantitative collision risk value. Subsequently, a path generation algorithm based on deep reinforcement learning (DRL) was proposed to determine the avoidance time and to generate an avoidance path complying the COLREGs for the most dangerous ship in terms of collision risk. The information of own ship and target ship such as location, speed, heading, collision risk is used as the input state, and the rudder angle of own ship is set as the output action of the DRL. The cost function related to the path following and the collision avoidance is defined as the reward of the DRLbased collision avoidance method. Additionally, the DRL modes are defined to navigate the flexible avoidance path by changing the ratio between the path following and the collision avoidance. To verify the proposed method, we compared the collision avoidance method with the A* algorithm, which is a traditional path planning algorithm, and analyzed the results for various scenarios. The proposed method reliably avoided collisions through flexible paths for complex and unexpected changes in situations compared to the A* algorithm.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] TOWARDS THE DEEP LEARNING-BASED AUTONOMOUS COLLISION AVOIDANCE
    He, Binxin
    Xiao, Youan
    Wang, Tengfei
    Li, Zhuo
    PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 9, 2022,
  • [2] CONTROL METHOD FOR PATH FOLLOWING AND COLLISION AVOIDANCE OF AUTONOMOUS SHIP BASED ON DEEP REINFORCEMENT LEARNING
    Zhao, Luman
    Roh, Myung-Il
    Lee, Sung-Jun
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2019, 27 (04): : 293 - 310
  • [3] Method for collision avoidance based on deep reinforcement learning with path-speed control for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Yu, Donghun
    INTERNATIONAL JOURNAL OF NAVAL ARCHITECTURE AND OCEAN ENGINEERING, 2024, 16
  • [4] Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning
    Zheng, Mao
    Xie, Shuo
    Chu, Xiumin
    Zhu, Tianquan
    Tian, Guohao
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
  • [5] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Lin, Wei-Ting
    Lin, Jyun-Ting
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [6] Deep reinforcement learning based collision avoidance system for autonomous ships
    Wang, Yong
    Xu, Haixiang
    Feng, Hui
    He, Jianhua
    Yang, Haojie
    Li, Fen
    Yang, Zhen
    OCEAN ENGINEERING, 2024, 292
  • [7] Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation
    Wang, Chengbo
    Zhang, Xinyu
    Yang, Zaili
    Bashir, Musa
    Lee, Kwangil
    FRONTIERS IN MARINE SCIENCE, 2023, 9
  • [8] A human-like collision avoidance method for autonomous ship with attention-based deep reinforcement learning
    Jiang, Lingling
    An, Lanxuan
    Zhang, Xinyu
    Wang, Chengbo
    Wang, Xinjian
    OCEAN ENGINEERING, 2022, 264
  • [9] DEEP REINFORCEMENT LEARNING FOR SHIP COLLISION AVOIDANCE AND PATH TRACKING
    Singht, Amar Nath
    Vijayakumar, Akash
    Balasubramaniyam, Shankruth
    Somayajula, Abhilash
    PROCEEDINGS OF ASME 2024 43RD INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, OMAE2024, VOL 5B, 2024,
  • [10] Ship Collision Avoidance Using Constrained Deep Reinforcement Learning
    Zhang, Rui
    Wang, Xiao
    Liu, Kezhong
    Wu, Xiaolie
    Lu, Tianyou
    Chao Zhaohui
    2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC), 2018, : 115 - 120