Deep Relationship Graph Reinforcement Learning for Multi-Aircraft Air Combat

被引：2

作者：

Han, Yue ^{[1
]}

Piao, Haiyin ^{[2
]}

Hou, Yaqing ^{[3
]}

Sun, Yang ^{[1
]}

Sun, Zhixiao ^{[4
]}

Zhou, Deyun ^{[5
]}

Yang, Shengqi ^{[1
]}

Peng, Xuanqi ^{[1
]}

Fan, Songyuan ^{[1
]}

机构：

[1] SADRI Inst, Dept AI Ctr, Shenyang, Peoples R China

[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian, Peoples R China

[3] Dalian Univ Technol, Sch Comp Sci & Technol, Dalin, Peoples R China

[4] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian, Peoples R China

[5] Northwestern Polytech Univ, Sch Microelect, Xian, Peoples R China

来源：

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年

关键词：

air combat AI; multi-aircraft collaboration; reinforcement learning; graph neural network;

D O I：

10.1109/IJCNN55064.2022.9892208

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Air combat Artificial Intelligence (AI) has attracted increasing attentions from aeronautics engineers and artificial intelligence researchers. However, it is often of great difficulties for the existing methods to solve the collaboration problems in multi-aircraft air combat due to their high complexity incurred by combination explosion. In view of this, we propose a Deep Relationship Graph Reinforcement Learning (DRGRL) algorithm for multi-aircraft collaboration. Specifically, DRGRL significantly simplifies the complex situation space via abstracting the original problem into a symbolic form. Besides, a novel Air Combat Relationship Graph (ACRG) is introduced to represent the learned collaboration pattern, which concentrates on the most important combat relationships for tactic decision making. Consequently, experiments are conducted in an air combat simulation environment named WUKONG. The comprehensive experimental results demonstrate that DRGRL could evidently learn some valuable collaboration patterns and achieve better combat performance than state-of-the-art air combat AI methods.

引用

页数：8

共 29 条

[1]

[Anonymous], 2020, press release

[2]

[Anonymous], 2002, P 19 INT C MACHINE L

[3]

Bonanni P., 1993, ART KILL

[4]

Dai HJ, 2018, PR MACH LEARN RES, V80

[5]

Ernest, 2016, J DEFEN MANAGE, DOI [10.4172/2167-0374.1000144, DOI 10.4172/2167-0374.1000144]

[6]

[Gao Jian 高坚], 2003, Chinese Journal of Aeronautics, V16, P223

[7]

Guo SN, 2019, AAAI CONF ARTIF INTE, P922

[8]

Gupta A., 2020, WORKSH AD LEARN AG A

[9] Autonomous air combat maneuver decision using Bayesian inference and moving horizon optimization [J].

Huang Changqiang ;

Dong Kangsheng ;

Huang Hanqiao ;

Tang Shangqin ;

Zhang Zhuoran .

JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2018, 29 (01) :86-97

[10] Design of Locatable Linkage Security System of intelligent Substation [J].

Huang, Yi ;

Zhou, Jianxin ;

Zheng, Jingrong ;

Zhang, Chuanyu ;

Wu, Yanwei .

2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, :727-731

← 1 2 3 →