Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning

被引:0
作者
Ding, Yunlong [1 ]
Kuang, Minchi [1 ,2 ]
Shi, Heng [2 ]
Gao, Jiazhan [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi 830046, Peoples R China
[2] Tsinghua Univ, Precis Instruments Dept, Beijing 100084, Peoples R China
关键词
target assignment; multi-UAV air combat; reinforcement learning; attention mechanism; PPO;
D O I
10.3390/drones8100562
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
To overcome the problems of traditional distributed target allocation algorithms in terms of lack of target strategic priority, poor scalability, and robustness, this paper proposes a proximal strategy optimization algorithm that combines threat assessment and attention mechanism (TAPPO). Based on the distributed training framework, the algorithm integrates a threat assessment and dynamic attention strategy and designs a dynamic reward function based on the current hit rate of the drone and the missile benefit ratio to improve the algorithm's exploration ability and scalability. Through an 8vs8 multi-UAV confrontation experiment in a digital twin simulation environment, the results show that the agent using the TAPPO algorithm for target allocation defeats the state machine with an 85% winning rate and is significantly better than other current mainstream target allocation algorithms, verifying the effectiveness of the algorithm.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Reinforcement Learning based Approach for Multi-UAV Cooperative Searching in Unknown Environments
    Yue, Wei
    Guan, Xianhe
    Xi, Yun
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2018 - 2023
  • [2] Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments
    Kong, Xiaoran
    Zhou, Yatong
    Li, Zhe
    Wang, Shaohai
    FRONTIERS IN NEUROROBOTICS, 2024, 17
  • [3] Extrinsic-and-Intrinsic Reward-Based Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Target Encirclement
    Chen, Jinchao
    Wang, Yang
    Zhang, Ying
    Lu, Yantao
    Shu, Qiuhao
    Hu, Yujiao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [4] Multi-UAV Collaborative Detection Based on Reinforcement Learning
    Hao, Yuanhui
    Guo, Chubing
    Ke, Liangjun
    ADVANCES IN SWARM INTELLIGENCE, PT I, ICSI 2024, 2024, 14788 : 463 - 474
  • [5] Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking
    Moon, Jiseon
    Papaioannou, Savvas
    Laoudias, Christos
    Kolios, Panayiotis
    Kim, Sunwoo
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) : 15441 - 15455
  • [6] Maintaining Connectivity for Multi-UAV Multi-Target Search Using Reinforcement Learning
    Guven, Islam
    Yanmaz, Evsen
    PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, DIVANET 2023, 2023, : 109 - 114
  • [7] Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
    Li, Shaowei
    Jia, Yuhong
    Yang, Fan
    Qin, Qingyang
    Gao, Hui
    Zhou, Yaoming
    IEEE ACCESS, 2022, 10 : 91385 - 91396
  • [8] Multi-UAV Air Combat Weapon-Target Assignment Based On Genetic Algorithm And Deep Learning
    Li, Gaolei
    Wang, Yuxing
    Lu, Chuan
    Zhang, Zhen
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3418 - 3423
  • [9] Optimization Design of Multi-UAV Communication Network Based on Reinforcement Learning
    Cao, Zhengyang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [10] Reinforcement Learning Based Trajectory Planning for Multi-UAV Load Transportation
    Estevez, Julian
    Manuel Lopez-Guede, Jose
    del Valle-Echavarri, Javier
    Grana, Manuel
    IEEE ACCESS, 2024, 12 : 144009 - 144016