An Algorithm of Complete Coverage Path Planning for Unmanned Surface Vehicle Based on Reinforcement Learning

被引：23

作者：

Xing, Bowen ^{[1
]}

Wang, Xiao ^{[1
,2
]}

Yang, Liu ^{[1
]}

Liu, Zhenchong ^{[3
]}

Wu, Qingyun ^{[1
]}

机构：

[1] Shanghai Ocean Univ, Coll Engn Sci & Technol, Shanghai 201306, Peoples R China

[2] Shanghai Invest Design & Res Inst, Shanghai 200335, Peoples R China

[3] Shanghai Zhongchuan NERC SDT Co Ltd, Shanghai 201114, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2023年 / 11卷 / 03期

关键词：

environment modeling; raster map; screening matrix; DQN; reward function;

D O I：

10.3390/jmse11030645

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

A deep reinforcement learning method to achieve complete coverage path planning for an unmanned surface vehicle (USV) is proposed. This paper firstly models the USV and the workspace required for complete coverage. Then, for the full-coverage path planning task, this paper proposes a preprocessing method for raster maps, which can effectively delete the blank areas that are impossible to cover in the raster map. In this paper, the state matrix corresponding to the preprocessed raster map is used as the input of the deep neural network. The deep Q network (DQN) is used to train the complete coverage path planning strategy of the agent. The improvement of the selection of random actions during training is first proposed. Considering the task of complete coverage path planning, this paper replaces random actions with a set of actions toward the nearest uncovered grid. To solve the problem of the slow convergence speed of the deep reinforcement learning network in full-coverage path planning, this paper proposes an improved method of deep reinforcement learning, which superimposes the final output layer with a dangerous actions matrix to reduce the risk of selection of dangerous actions of USVs during the learning process. Finally, the designed method validates via simulation examples.

引用

页数：19

共 50 条

[41] Application of Improved Genetic Algorithm to Unmanned Surface Vehicle Path Planning
Long, Yang
Su, Yixin
Zhang, Huajun
Li, Ming
PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 209 - 212
[42] Hybrid bacterial foraging algorithm for unmanned surface vehicle path planning
Long Y.
Su Y.
Lian C.
Zhang D.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 68 - 73
[43] Reinforcement learning-based complete area coverage path planning for a modified htrihex robot
Apuroop, Koppaka Ganesh Sai
Le, Anh Vu
Elara, Mohan Rajesh
Sheu, Bing J.
Sensors (Switzerland), 2021, 21 (04): : 1 - 20
[44] Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot
Apuroop, Koppaka Ganesh Sai
Le, Anh Vu
Elara, Mohan Rajesh
Sheu, Bing J.
SENSORS, 2021, 21 (04) : 1 - 20
[45] Complete coverage path planning using reinforcement learning for Tetromino based cleaning and maintenance robot
Lakshmanan, Anirudh Krishna
Elara, Mohan Rajesh
Ramalingam, Balakrishnan
Anh Vu Le
Veerajagadeshwar, Prabahar
Tiwari, Kamlesh
Ilyas, Muhammad
AUTOMATION IN CONSTRUCTION, 2020, 112
[46] Biologically Inspired Complete Coverage Path Planning Algorithm Based on Q-Learning
Tan, Xiangquan
Han, Linhui
Gong, Hao
Wu, Qingwen
SENSORS, 2023, 23 (10)
[47] Research on Collision Avoidance Algorithm of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
Xia, Jiawei
Zhu, Xufang
Liu, Zhikun
Luo, Yasong
Wu, Zhaodong
Wu, Qiuhan
IEEE SENSORS JOURNAL, 2023, 23 (11) : 11262 - 11273
[48] Risk-Aware Complete Coverage Path Planning Using Reinforcement Learning
Wijegunawardana, I. D.
Samarakoon, S. M. Bhagya P.
Muthugala, M. A. Viraj J.
Elara, Mohan Rajesh
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (04): : 2476 - 2488
[49] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
Guo, Rui
Yuan, Wei
INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
[50] Complete Coverage Path Planning Based on Ant Colony Algorithm
Zhang Chibin
Wang Xingsong
Du Yong
2008 15TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2008, : 346 - 350

← 1 2 3 4 5 →