Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform

被引:3
作者
Kong, Weiren [1 ]
Zhou, Deyun [1 ]
Du, Yongjie [1 ]
Zhou, Ying [1 ]
Zhao, Yiyang [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial intelligence (AI); autonomous air combat; competitive self-play (SP); maneuver decision-making; multiagent reinforcement learning (MARL); GAME; DECISION; GO;
D O I
10.1109/JSEN.2022.3220324
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.
引用
收藏
页码:20596 / 20606
页数:11
相关论文
共 49 条
  • [1] Arrow KJ., 1958, Studies in linear and nonlinear programming
  • [2] GAME-THEORY FOR AUTOMATED MANEUVERING DURING AIR-TO-AIR COMBAT
    AUSTIN, F
    CARBONE, G
    FALCO, M
    HINZ, H
    LEWIS, M
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1990, 13 (06) : 1143 - 1149
  • [3] Austin F., 1987, GUIDANCE NAVIGATION, P2393
  • [4] Balduzzi D, 2019, PR MACH LEARN RES, V97
  • [5] Bansal T, 2018, Arxiv, DOI arXiv:1710.03748
  • [6] Bechtel R.J., 1992, Air Combat Maneuvering Expert System Trainer
  • [7] Brown GW., 1951, ACTIVITY ANAL PRODUC, V13, P374
  • [8] Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
  • [9] A New Approach to Weapon-Target Assignment in Cooperative Air Combat
    Chang, Yi-zhe
    Li, Zhan-wu
    Kou, Ying-xin
    Sun, Qing-peng
    Yang, Hai-yan
    Zhao, Zheng-yan
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [10] Czarnecki WM., 2020, ADV NEURAL INFORM PR, V33, P17443