An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

被引:23
|
作者
Xing, Bowen [1 ]
Wang, Xiao [1 ,2 ]
Yang, Liu [1 ]
Liu, Zhenchong [3 ]
Wu, Qingyun [1 ]
机构
[1] Shanghai Ocean Univ, Coll Engn Sci & Technol, Shanghai 201306, Peoples R China
[2] Shanghai Invest Design & Res Inst, Shanghai 200335, Peoples R China
[3] Shanghai Zhongchuan NERC SDT Co Ltd, Shanghai 201114, Peoples R China
关键词
environment modeling; raster map; screening matrix; DQN; reward function;
D O I
10.3390/jmse11030645
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Application of Improved Genetic Algorithm to Unmanned Surface Vehicle Path Planning
    Long, Yang
    Su, Yixin
    Zhang, Huajun
    Li, Ming
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 209 - 212
  • [42] Hybrid bacterial foraging algorithm for unmanned surface vehicle path planning
    Long Y.
    Su Y.
    Lian C.
    Zhang D.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 68 - 73
  • [43] Reinforcement learning-based complete area coverage path planning for a modified htrihex robot
    Apuroop, Koppaka Ganesh Sai
    Le, Anh Vu
    Elara, Mohan Rajesh
    Sheu, Bing J.
    Sensors (Switzerland), 2021, 21 (04): : 1 - 20
  • [44] Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot
    Apuroop, Koppaka Ganesh Sai
    Le, Anh Vu
    Elara, Mohan Rajesh
    Sheu, Bing J.
    SENSORS, 2021, 21 (04) : 1 - 20
  • [45] Complete coverage path planning using reinforcement learning for Tetromino based cleaning and maintenance robot
    Lakshmanan, Anirudh Krishna
    Elara, Mohan Rajesh
    Ramalingam, Balakrishnan
    Anh Vu Le
    Veerajagadeshwar, Prabahar
    Tiwari, Kamlesh
    Ilyas, Muhammad
    AUTOMATION IN CONSTRUCTION, 2020, 112
  • [46] Biologically Inspired Complete Coverage Path Planning Algorithm Based on Q-Learning
    Tan, Xiangquan
    Han, Linhui
    Gong, Hao
    Wu, Qingwen
    SENSORS, 2023, 23 (10)
  • [47] Research on Collision Avoidance Algorithm of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Xia, Jiawei
    Zhu, Xufang
    Liu, Zhikun
    Luo, Yasong
    Wu, Zhaodong
    Wu, Qiuhan
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11262 - 11273
  • [48] Risk-Aware Complete Coverage Path Planning Using Reinforcement Learning
    Wijegunawardana, I. D.
    Samarakoon, S. M. Bhagya P.
    Muthugala, M. A. Viraj J.
    Elara, Mohan Rajesh
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (04): : 2476 - 2488
  • [49] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Guo, Rui
    Yuan, Wei
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [50] Complete Coverage Path Planning Based on Ant Colony Algorithm
    Zhang Chibin
    Wang Xingsong
    Du Yong
    2008 15TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2008, : 346 - 350