Dynamic Target Assignment by Unmanned Surface Vehicles Based on Reinforcement Learning

被引：1

作者：

Hu, Tao ^{[1
]}

Zhang, Xiaoxue ^{[1
]}

Luo, Xueshan ^{[1
]}

Chen, Tao ^{[1
]}

机构：

[1] Natl Univ Def Technol, Natl Key Lab Informat Syst Engn, Changsha 410073, Peoples R China

来源：

MATHEMATICS | 2024年 / 12卷 / 16期

基金：

中国国家自然科学基金;

关键词：

moving targets; weapon-target assignment; unmanned surface vessels; reinforcement learning; multi agent; LARGE NEIGHBORHOOD SEARCH; MISSILE DEFENSE; TASK ASSIGNMENT; OPTIMIZATION; ALLOCATION; ALGORITHM; HYBRID;

D O I：

10.3390/math12162557

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Due to the dynamic complexities of the multi-unmanned vessel target assignment problem at sea, especially when addressing moving targets, traditional optimization algorithms often fail to quickly find an adequate solution. To overcome this, we have developed a multi-agent reinforcement learning algorithm. This approach involves defining a state space, employing preferential experience replay, and integrating self-attention mechanisms, which are applied to a novel offshore unmanned vessel model designed for dynamic target allocation. We have conducted a thorough analysis of strike positions and times, establishing robust mathematical models. Additionally, we designed several experiments to test the effectiveness of the algorithm. The proposed algorithm improves the quality of the solution by at least 30% in larger scale scenarios compared to the genetic algorithm (GA), and the average solution speed is less than 10% of the GA, demonstrating the feasibility of the algorithm in solving the problem.

引用

页数：20

共 49 条

[1] Oriented stochastic loss descent algorithm to train very deep multi-layer neural networks without vanishing gradients
Abuqaddom, Inas
Mahafzah, Basel A.
Faris, Hossam
[J]. KNOWLEDGE-BASED SYSTEMS, 2021, 230
[2] Efficient Task Assignment for Multiple Vehicles With Partially Unreachable Target Locations
Bai, Xiaoshan
Yan, Weisheng
Ge, Shuzhi Sam
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05): : 3730 - 3742
[3] Bello I., 2016, Neural Combinatorial Optimization with Reinforcement Learning, DOI 10.48550/arXiv.1611.09940
[4] Missile defense and interceptor allocation by neuro-dynamic programming
Bertsekas, DP
Homer, ML
Logan, DA
Patek, SD
Sandell, NR
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (01): : 42 - 51
[5] Hybrid genetic-simulated annealing algorithm for optimal weapon allocation in multilayer defence scenario
Bisht, S
[J]. DEFENCE SCIENCE JOURNAL, 2004, 54 (03) : 395 - 405
[6] Quick Collateral Damage Estimation Based on Weapons Assigned to Targets
Bogdanowicz, Zbigniew R.
Patel, Ketula
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (05): : 762 - 769
[7] Fire scheduling for planned artillery attack operations under time-dependent destruction probabilities
Cha, Young-Ho
Kim, Yeong-Dae
[J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2010, 38 (05): : 383 - 392
[8] A robust optimization approach for an artillery fire-scheduling problem under uncertain threat
Choi, Yong Baek
Jin, Suk Ho
Kim, Kyung Sup
Chung, Byung Do
[J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 23 - 32
[9] Chung T.H., 2023, Field Robot, V3, P97, DOI [10.55417/fr.2023003, DOI 10.55417/FR.2023003]
[10] Dai HJ, 2020, Arxiv, DOI arXiv:1603.05629

← 1 2 3 4 5 →