Dynamic Target Assignment by Unmanned Surface Vehicles Based on Reinforcement Learning

被引:1
作者
Hu, Tao [1 ]
Zhang, Xiaoxue [1 ]
Luo, Xueshan [1 ]
Chen, Tao [1 ]
机构
[1] Natl Univ Def Technol, Natl Key Lab Informat Syst Engn, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
moving targets; weapon-target assignment; unmanned surface vessels; reinforcement learning; multi agent; LARGE NEIGHBORHOOD SEARCH; MISSILE DEFENSE; TASK ASSIGNMENT; OPTIMIZATION; ALLOCATION; ALGORITHM; HYBRID;
D O I
10.3390/math12162557
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Due to the dynamic complexities of the multi-unmanned vessel target assignment problem at sea, especially when addressing moving targets, traditional optimization algorithms often fail to quickly find an adequate solution. To overcome this, we have developed a multi-agent reinforcement learning algorithm. This approach involves defining a state space, employing preferential experience replay, and integrating self-attention mechanisms, which are applied to a novel offshore unmanned vessel model designed for dynamic target allocation. We have conducted a thorough analysis of strike positions and times, establishing robust mathematical models. Additionally, we designed several experiments to test the effectiveness of the algorithm. The proposed algorithm improves the quality of the solution by at least 30% in larger scale scenarios compared to the genetic algorithm (GA), and the average solution speed is less than 10% of the GA, demonstrating the feasibility of the algorithm in solving the problem.
引用
收藏
页数:20
相关论文
共 49 条
  • [1] Oriented stochastic loss descent algorithm to train very deep multi-layer neural networks without vanishing gradients
    Abuqaddom, Inas
    Mahafzah, Basel A.
    Faris, Hossam
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 230
  • [2] Efficient Task Assignment for Multiple Vehicles With Partially Unreachable Target Locations
    Bai, Xiaoshan
    Yan, Weisheng
    Ge, Shuzhi Sam
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05): : 3730 - 3742
  • [3] Bello I., 2016, Neural Combinatorial Optimization with Reinforcement Learning, DOI 10.48550/arXiv.1611.09940
  • [4] Missile defense and interceptor allocation by neuro-dynamic programming
    Bertsekas, DP
    Homer, ML
    Logan, DA
    Patek, SD
    Sandell, NR
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (01): : 42 - 51
  • [5] Hybrid genetic-simulated annealing algorithm for optimal weapon allocation in multilayer defence scenario
    Bisht, S
    [J]. DEFENCE SCIENCE JOURNAL, 2004, 54 (03) : 395 - 405
  • [6] Quick Collateral Damage Estimation Based on Weapons Assigned to Targets
    Bogdanowicz, Zbigniew R.
    Patel, Ketula
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (05): : 762 - 769
  • [7] Fire scheduling for planned artillery attack operations under time-dependent destruction probabilities
    Cha, Young-Ho
    Kim, Yeong-Dae
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2010, 38 (05): : 383 - 392
  • [8] A robust optimization approach for an artillery fire-scheduling problem under uncertain threat
    Choi, Yong Baek
    Jin, Suk Ho
    Kim, Kyung Sup
    Chung, Byung Do
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 23 - 32
  • [9] Chung T.H., 2023, Field Robot, V3, P97, DOI [10.55417/fr.2023003, DOI 10.55417/FR.2023003]
  • [10] Dai HJ, 2020, Arxiv, DOI arXiv:1603.05629