Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform

被引：3

作者：

Kong, Weiren ^{[1
]}

Zhou, Deyun ^{[1
]}

Du, Yongjie ^{[1
]}

Zhou, Ying ^{[1
]}

Zhao, Yiyang ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2023年 / 23卷 / 18期

基金：

中国国家自然科学基金;

关键词：

Artificial intelligence (AI); autonomous air combat; competitive self-play (SP); maneuver decision-making; multiagent reinforcement learning (MARL); GAME; DECISION; GO;

D O I：

10.1109/JSEN.2022.3220324

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.

引用

页码：20596 / 20606

页数：11

共 49 条

[1] Arrow KJ., 1958, Studies in linear and nonlinear programming
[2] GAME-THEORY FOR AUTOMATED MANEUVERING DURING AIR-TO-AIR COMBAT
AUSTIN, F
CARBONE, G
FALCO, M
HINZ, H
LEWIS, M
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1990, 13 (06) : 1143 - 1149
[3] Austin F., 1987, GUIDANCE NAVIGATION, P2393
[4] Balduzzi D, 2019, PR MACH LEARN RES, V97
[5] Bansal T, 2018, Arxiv, DOI arXiv:1710.03748
[6] Bechtel R.J., 1992, Air Combat Maneuvering Expert System Trainer
[7] Brown GW., 1951, ACTIVITY ANAL PRODUC, V13, P374
[8] Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
[9] A New Approach to Weapon-Target Assignment in Cooperative Air Combat
Chang, Yi-zhe
Li, Zhan-wu
Kou, Ying-xin
Sun, Qing-peng
Yang, Hai-yan
Zhao, Zheng-yan
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
[10] Czarnecki WM., 2020, ADV NEURAL INFORM PR, V33, P17443

← 1 2 3 4 5 →