Multi-Unmanned Aerial Vehicle Confrontation in Intelligent Air Combat: A Multi-Agent Deep Reinforcement Learning Approach

被引：3

作者：

Yang, Jianfeng ^{[1
]}

Yang, Xinwei ^{[2
]}

Yu, Tianqi ^{[1
]}

机构：

[1] Soochow Univ, Sch Elect & Informat Engn, Suzhou 215006, Peoples R China

[2] Guangdong Power Grid Corp, Dongguan Power Supply Bur, Dongguan, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 08期

基金：

中国国家自然科学基金;

关键词：

multi-UAV confrontation; intelligent decision-making; multi-agent deep reinforcement learning; DECISION-MAKING;

D O I：

10.3390/drones8080382

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Multiple unmanned aerial vehicle (multi-UAV) confrontation is becoming an increasingly important combat mode in intelligent air combat. The confrontation highly relies on the intelligent collaboration and real-time decision-making of the UAVs. Thus, a decomposed and prioritized experience replay (PER)-based multi-agent deep deterministic policy gradient (DP-MADDPG) algorithm has been proposed in this paper for the moving and attacking decisions of UAVs. Specifically, the confrontation is formulated as a partially observable Markov game. To solve the problem, the DP-MADDPG algorithm is proposed by integrating the decomposed and PER mechanisms into the traditional MADDPG. To overcome the technical challenges of the convergence to a local optimum and a single dominant policy, the decomposed mechanism is applied to modify the MADDPG framework with local and global dual critic networks. Furthermore, to improve the convergence rate of the MADDPG training process, the PER mechanism is utilized to optimize the sampling efficiency from the experience replay buffer. Simulations have been conducted based on the Multi-agent Combat Arena (MaCA) platform, wherein the traditional MADDPG and independent learning DDPG (ILDDPG) algorithms are benchmarks. Simulation results indicate that the proposed DP-MADDPG improves the convergence rate and the convergent reward value. During confrontations against the vanilla distance-prioritized rule-empowered and intelligent ILDDPG-empowered blue parties, the DP-MADDPG-empowered red party can improve the win rate to 96% and 80.5%, respectively.

引用

页数：18

共 50 条

[1] Deep Reinforcement Learning With Application to Air Confrontation Intelligent Decision-Making of Manned/Unmanned Aerial Vehicle Cooperative System
Li, Yue
Han, Wei
Wang, Yongqing
IEEE ACCESS, 2020, 8 (08): : 67887 - 67898
[2] A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
Arishi, Ali
Krishnan, Krishna
JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (03) : 493 - 515
[3] Multi-Agent Deep Reinforcement Learning Framework Strategized by Unmanned Aerial Vehicles for Multi-Vessel Full Communication Connection
Cao, Jiabao
Dou, Jinfeng
Liu, Jilong
Wei, Xuanning
Guo, Zhongwen
REMOTE SENSING, 2023, 15 (16)
[4] Cooperative Multi-UAV Positioning for Aerial Internet Service Management: A Multi-Agent Deep Reinforcement Learning Approach
Kim, Joongheon
Park, Soohyun
Jung, Soyi
Cordeiro, Carlos
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 3797 - 3812
[5] Lenient Multi-Agent Deep Reinforcement Learning
Palmer, Gregory
Tuyls, Karl
Bloembergen, Daan
Savani, Rahul
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 443 - 451
[6] Intelligent games meeting with multi-agent deep reinforcement learning: a comprehensive review
Wang, Yiqin
Wang, Yufeng
Tian, Feng
Ma, Jianhua
Jin, Qun
ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (06)
[7] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
Zhang Jiandong
Yang Qiming
Shi Guoqing
Lu Yi
Wu Yong
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
[8] Intelligent EV Charging for Urban Prosumer Communities: An Auction and Multi-Agent Deep Reinforcement Learning Approach
Zou, Luyao
Munir, Md. Shirajum
Tun, Yan Kyaw
Kang, Seokwon
Hong, Choong Seon
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4384 - 4407
[9] Strategic Interaction Multi-Agent Deep Reinforcement Learning
Zhou, Wenhong
Li, Jie
Chen, Yiting
Shen, Lin-Cheng
IEEE ACCESS, 2020, 8 : 119000 - 119009
[10] Competitive Evolution Multi-Agent Deep Reinforcement Learning
Zhou, Wenhong
Chen, Yiting
Li, Jie
PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,

← 1 2 3 4 5 →