Multi-Unmanned Aerial Vehicle Confrontation in Intelligent Air Combat: A Multi-Agent Deep Reinforcement Learning Approach

被引:3
|
作者
Yang, Jianfeng [1 ]
Yang, Xinwei [2 ]
Yu, Tianqi [1 ]
机构
[1] Soochow Univ, Sch Elect & Informat Engn, Suzhou 215006, Peoples R China
[2] Guangdong Power Grid Corp, Dongguan Power Supply Bur, Dongguan, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-UAV confrontation; intelligent decision-making; multi-agent deep reinforcement learning; DECISION-MAKING;
D O I
10.3390/drones8080382
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Multiple unmanned aerial vehicle (multi-UAV) confrontation is becoming an increasingly important combat mode in intelligent air combat. The confrontation highly relies on the intelligent collaboration and real-time decision-making of the UAVs. Thus, a decomposed and prioritized experience replay (PER)-based multi-agent deep deterministic policy gradient (DP-MADDPG) algorithm has been proposed in this paper for the moving and attacking decisions of UAVs. Specifically, the confrontation is formulated as a partially observable Markov game. To solve the problem, the DP-MADDPG algorithm is proposed by integrating the decomposed and PER mechanisms into the traditional MADDPG. To overcome the technical challenges of the convergence to a local optimum and a single dominant policy, the decomposed mechanism is applied to modify the MADDPG framework with local and global dual critic networks. Furthermore, to improve the convergence rate of the MADDPG training process, the PER mechanism is utilized to optimize the sampling efficiency from the experience replay buffer. Simulations have been conducted based on the Multi-agent Combat Arena (MaCA) platform, wherein the traditional MADDPG and independent learning DDPG (ILDDPG) algorithms are benchmarks. Simulation results indicate that the proposed DP-MADDPG improves the convergence rate and the convergent reward value. During confrontations against the vanilla distance-prioritized rule-empowered and intelligent ILDDPG-empowered blue parties, the DP-MADDPG-empowered red party can improve the win rate to 96% and 80.5%, respectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Deep Reinforcement Learning With Application to Air Confrontation Intelligent Decision-Making of Manned/Unmanned Aerial Vehicle Cooperative System
    Li, Yue
    Han, Wei
    Wang, Yongqing
    IEEE ACCESS, 2020, 8 (08): : 67887 - 67898
  • [2] A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
    Arishi, Ali
    Krishnan, Krishna
    JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (03) : 493 - 515
  • [3] Multi-Agent Deep Reinforcement Learning Framework Strategized by Unmanned Aerial Vehicles for Multi-Vessel Full Communication Connection
    Cao, Jiabao
    Dou, Jinfeng
    Liu, Jilong
    Wei, Xuanning
    Guo, Zhongwen
    REMOTE SENSING, 2023, 15 (16)
  • [4] Cooperative Multi-UAV Positioning for Aerial Internet Service Management: A Multi-Agent Deep Reinforcement Learning Approach
    Kim, Joongheon
    Park, Soohyun
    Jung, Soyi
    Cordeiro, Carlos
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 3797 - 3812
  • [5] Lenient Multi-Agent Deep Reinforcement Learning
    Palmer, Gregory
    Tuyls, Karl
    Bloembergen, Daan
    Savani, Rahul
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 443 - 451
  • [6] Intelligent games meeting with multi-agent deep reinforcement learning: a comprehensive review
    Wang, Yiqin
    Wang, Yufeng
    Tian, Feng
    Ma, Jianhua
    Jin, Qun
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (06)
  • [7] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    Zhang Jiandong
    Yang Qiming
    Shi Guoqing
    Lu Yi
    Wu Yong
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
  • [8] Intelligent EV Charging for Urban Prosumer Communities: An Auction and Multi-Agent Deep Reinforcement Learning Approach
    Zou, Luyao
    Munir, Md. Shirajum
    Tun, Yan Kyaw
    Kang, Seokwon
    Hong, Choong Seon
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4384 - 4407
  • [9] Strategic Interaction Multi-Agent Deep Reinforcement Learning
    Zhou, Wenhong
    Li, Jie
    Chen, Yiting
    Shen, Lin-Cheng
    IEEE ACCESS, 2020, 8 : 119000 - 119009
  • [10] Competitive Evolution Multi-Agent Deep Reinforcement Learning
    Zhou, Wenhong
    Chen, Yiting
    Li, Jie
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,