An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

被引:23
|
作者
Xing, Bowen [1 ]
Wang, Xiao [1 ,2 ]
Yang, Liu [1 ]
Liu, Zhenchong [3 ]
Wu, Qingyun [1 ]
机构
[1] Shanghai Ocean Univ, Coll Engn Sci & Technol, Shanghai 201306, Peoples R China
[2] Shanghai Invest Design & Res Inst, Shanghai 200335, Peoples R China
[3] Shanghai Zhongchuan NERC SDT Co Ltd, Shanghai 201114, Peoples R China
关键词
environment modeling; raster map; screening matrix; DQN; reward function;
D O I
10.3390/jmse11030645
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] A coverage path planning approach for environmental monitoring using an unmanned surface vehicle
    Ramkumar Sudha S.K.
    Mishra D.
    Hameed I.A.
    Ocean Engineering, 2024, 310
  • [32] Optimal search path planning for unmanned surface vehicle based on an improved genetic algorithm
    Guo, Hui
    Mao, Zhaoyong
    Ding, Wenjun
    Liu, Peiliang
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 79
  • [33] Path Planning for Unmanned Surface Vehicle based on genetic algorithm and sequential quadratic programming
    Zhuang, Yufei
    Wang, Cheng
    Huang, Haibin
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3513 - 3518
  • [34] Global Path Planning of Unmanned Surface Vehicle Based on Improved A-Star Algorithm
    Zhang, Huixia
    Tao, Yadong
    Zhu, Wenliang
    SENSORS, 2023, 23 (14)
  • [35] Global path planning of unmanned vehicle based on improved A* algorithm
    Liang, Hao
    Du, Xiaofang
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 176 - 184
  • [36] Unmanned aerial vehicle path planning based on TLBO algorithm
    Yu, Guolin (guolin_yu@126.com), 1600, Massey University (07):
  • [37] Unmanned aircraft vehicle path planning based on SVM algorithm
    Chen, Yanhong
    Zu, Wei
    Fan, Guoliang
    Chang, Hongxing
    Advances in Intelligent Systems and Computing, 2014, 215 : 705 - 714
  • [38] UNMANNED AERIAL VEHICLE PATH PLANNING BASED ON TLBO ALGORITHM
    Yu, Guolin
    Song, Hui
    Gao, Jie
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (03) : 1310 - 1325
  • [39] Application of improved RRT algorithm in unmanned surface vehicle path planning
    Lin, Yutong
    Zhang, Wenjun
    Mu, Congrui
    Wang, Jianhui
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4861 - 4865
  • [40] An Improved Genetic Algorithm for Path-Planning of Unmanned Surface Vehicle
    Xin, Junfeng
    Zhong, Jiabao
    Yang, Fengru
    Cui, Ying
    Sheng, Jinlu
    SENSORS, 2019, 19 (11)