Ship Collision Avoidance Using Constrained Deep Reinforcement Learning

被引:0
|
作者
Zhang, Rui [1 ]
Wang, Xiao [2 ]
Liu, Kezhong [3 ]
Wu, Xiaolie [4 ]
Lu, Tianyou [2 ]
Chao Zhaohui [2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Hubei Key Lab Transportat Internet Things, Wuhan 434070, Hubei, Peoples R China
[2] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 434070, Hubei, Peoples R China
[3] Wuhan Univ Technol, Sch Nav, Hubei Key Lab Inland Shipping Technol, Wuhan 434070, Hubei, Peoples R China
[4] Wuhan Univ Technol, Sch Nav, Wuhan 434070, Hubei, Peoples R China
来源
2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC) | 2018年
基金
中国国家自然科学基金;
关键词
reinforcement learning; constraint; collision avoidance; Deep Q Network;
D O I
10.1109/BESC.2018.00031
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, the rapid development of mobile technology and application platforms has provided better services for life and work. Artificial intelligence and mobile technology have made traffic ever more convenient. As an artificial intelligence method that intersects with multiple disciplines and fields, reinforcement learning has been proved to be highly effective in the automatic driving of vehicles. However, there are still many difficulties in ship collision avoidance, because it involves continuous actions and complicated regulations. We find that by constraining the states, actions and regulation of reinforcement learning, we can well apply reinforcement learning to ship collision avoidance with vast states and actions at the same time. Hence, we propose Constrained-DQN(Deep Q Network), which is used to limit the state and action set, and separate reward value via different regulations. Experiments show that Constrained-DQN is more stable and adaptive in handling continuous space than traditional path planning algorithms.
引用
收藏
页码:115 / 120
页数:6
相关论文
共 50 条
  • [21] A learning method for AUV collision avoidance through deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Du, Xue
    OCEAN ENGINEERING, 2022, 260
  • [22] Formation Control with Collision Avoidance through Deep Reinforcement Learning
    Sui, Zezhi
    Pu, Zhiqiang
    Yi, Jianqiang
    Xiong, Tianyi
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [23] Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning
    Song, Sirui
    Saunders, Kirk
    Yue, Ye
    Liu, Jundong
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 914 - 919
  • [24] An Aircraft Collision Avoidance Method Based on Deep Reinforcement Learning
    Liu, Zuocheng
    Neretin, Evgeny
    Gao, Xiaoguang
    Wan, Kaifang
    2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 241 - 246
  • [25] Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning
    Zheng, Mao
    Xie, Shuo
    Chu, Xiumin
    Zhu, Tianquan
    Tian, Guohao
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
  • [26] Research on Collision Avoidance Algorithm of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Xia, Jiawei
    Zhu, Xufang
    Liu, Zhikun
    Luo, Yasong
    Wu, Zhaodong
    Wu, Qiuhan
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11262 - 11273
  • [27] Generalized Behavior Decision-Making Model for Ship Collision Avoidance via Reinforcement Learning Method
    Guan, Wei
    Zhao, Ming-yang
    Zhang, Cheng-bao
    Xi, Zhao-yong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (02)
  • [28] Deep reinforcement learning based controller for ship navigation
    Deraj, Rohit
    Kumar, R. S. Sanjeev
    Alam, Md Shadab
    Somayajula, Abhilash
    OCEAN ENGINEERING, 2023, 273
  • [29] COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle Using Deep Reinforcement Learning
    Meyer, Eivind
    Heiberg, Amalie
    Rasheed, Adil
    San, Omer
    IEEE ACCESS, 2020, 8 (08): : 165344 - 165364
  • [30] WORK PROCESS TRANSFER REINFORCEMENT LEARNING: FEATURE EXTRACTION AND FINETUNING IN SHIP COLLISION AVOIDANCE
    Wang, Xinrui
    Jin, Yan
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,