Deep hierarchical reinforcement learning based formation planning for multiple unmanned surface vehicles with experimental results

被引:25
作者
Wei, Xiangwei [1 ]
Wang, Hao [1 ]
Tang, Yixuan [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Hierarchical reinforcement learning; Artificial potential field; Formation control; Unmanned surface vehicles; CONTROLLER;
D O I
10.1016/j.oceaneng.2023.115577
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In this paper, a novel multi-USV formation path planning algorithm is proposed based on deep reinforcement learning. First, a goal-based hierarchical reinforcement learning algorithm is designed to improve training speed and resolve planning conflicts within the formation. Second, an improved artificial potential field algorithm is designed in the training process to obtain the optimal path planning and obstacle avoidance learning scheme for multi-USVs in the determined perceptual environment. Finally, a formation geometry model is established to describe the physical relationships among USVs, and a composite reward function is proposed to guide the training. Numerous simulation tests are conducted, and the effectiveness of the proposed algorithm are further validated through the NEU-MSV01 experimental platform with a combination of parameterized Line of Sight (LOS) guidance.
引用
收藏
页数:9
相关论文
共 25 条
[1]  
Gonzalez-Garcia A., 2020, P 2020 4 INT C DEEP, P118
[2]   USV Path-Following Control Based On Deep Reinforcement Learning and Adaptive Control [J].
Gonzalez-Garcia, Alejandro ;
Castaneda, Herman ;
Garrido, Leonardo .
GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
[3]  
Gu N, 2017, CHIN CONTR CONF, P6632, DOI 10.23919/ChiCC.2017.8028408
[4]   Adaptive tracking control of unmanned underwater vehicles with compensation for external perturbations and uncertainties using Port-Hamiltonian theory [J].
Jia, Zehua ;
Qiao, Lei ;
Zhang, Weidong .
OCEAN ENGINEERING, 2020, 209
[5]   The angle guidance path planning algorithms for unmanned surface vehicle formations by using the fast marching method [J].
Liu, Yuanchang ;
Bucknall, Richard .
APPLIED OCEAN RESEARCH, 2016, 59 :327-344
[6]   Path planning algorithm for unmanned surface vehicle formations in a practical maritime environment [J].
Liu, Yuanchang ;
Bucknall, Richard .
OCEAN ENGINEERING, 2015, 97 :126-144
[7]   Fault Detection Filter and Controller Co-Design for Unmanned Surface Vehicles Under DoS Attacks [J].
Ma, Yong ;
Nie, Zongqiang ;
Hu, Songlin ;
Li, Zhixiong ;
Malekian, Reza ;
Sotelo, M. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) :1422-1434
[8]   Neuro-adaptive distributed formation tracking control of under-actuated unmanned surface vehicles with input quantization [J].
Ning, Jun ;
Li, Tieshan ;
Chen, C. L. Philip .
OCEAN ENGINEERING, 2022, 265
[9]   Data-driven distributed formation control of under-actuated unmanned surface vehicles with collision avoidance via model-based deep reinforcement learning [J].
Pan, Chao ;
Peng, Zhouhua ;
Liu, Lu ;
Wang, Dan .
OCEAN ENGINEERING, 2023, 267
[10]  
Peng Z., 2023, IEEE Trans. Veh. Technol.