Learning System for Air Combat Decision Inspired by Cognitive Mechanisms of the Brain

被引：24

作者：

Zhou, Kai ^{[1
]}

Wei, Ruixuan ^{[1
]}

Zhang, Qirui ^{[1
]}

Xu, Zhuofan ^{[2
]}

机构：

[1] Air Force Engn Univ, Aeronaut Engn Coll, Xian 710038, Peoples R China

[2] Natl Def Univ Peoples Liberat Army, Joint Operat Coll, Shijiazhuang 050084, Hebei, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Autonomous air combat; bio-inspired; cognitive mechanism; long short-term memory; learning system; unmanned aerial vehicles; SEQUENTIAL MANEUVERING DECISIONS; THEORETICAL APPROACH; INFLUENCE DIAGRAM; GAME; PREDICTION; NETWORKS;

D O I：

10.1109/ACCESS.2020.2964031

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Unmanned aerial vehicles (UAVs) have played an important role in recent high-tech local wars. Seizing air control rights with UAVs will undoubtedly be a popular topic in future military development. Autonomous air combat is complex, antagonistic and mutable, and consequently, the decision-making that depends on unmanned systems is extremely challenging with very little research having been conducted on it. An intelligent air combat learning system inspired by the learning mechanisms of the brain is proposed in this paper. In accordance with research on learning, knowledge and memory, we constructed a cognitive mechanism model of the brain. Based on this model and the inferential abilities of humans, a long short-term hierarchical multi-line learning system is established. Then, the bio-inspired architecture and the basic learning principle of the system are clarified. Taking advantage of the conclusions of studies on information theory, the relationship between the knowledge updating cycle and the system learning performance is analysed. The updating cycle length adjustment problem is transformed into an optimization problem optimization problem, and system performance improvement is guaranteed. Experiments show that the system designed in this paper can acquire confrontation abilities through self-learning without prior rules; the parallel universe mechanism can significantly improve the system & x2019;s learning speed when the number of parallels is within 40, and the performance of the system improves gradually and continuously. The system can master actions similar to classical tactical manoeuvres such as the high yo-yo and the barrel-roll-attack without prior knowledge. Compared with the Bayesian inference and moving horizon optimization (BI & x0026;MHO) method, the learning system proposed in this paper is more flexible in situation assessment and in the prediction of opponents & x2019; actions. Although it cannot be deployed quickly, it has a continuous learning ability.

引用

页码：8129 / 8144

页数：16

共 64 条

[1]

[Anonymous], 2015, THESIS

[2]

[Anonymous], 2018, CANCER DISCOV, DOI DOI 10.1158/2159-8290.CD-17-1260

[3]

[Anonymous], 2005, Information Theory, Inference and Learning Algorithms

[4]

[Anonymous], 2016, ARXIV151106581

[5]

[Anonymous], 2010, P ICML 2010 P 27 INT

[6] Vector-based navigation using grid-like representations in artificial agents [J].

Banino, Andrea ;

Barry, Caswell ;

Uria, Benigno ;

Blundell, Charles ;

Lillicrap, Timothy ;

Mirowski, Piotr ;

Pritzel, Alexander ;

Chadwick, Martin J. ;

Degris, Thomas ;

Modayil, Joseph ;

Wayne, Greg ;

Soyer, Hubert ;

Viola, Fabio ;

Zhang, Brian ;

Goroshin, Ross ;

Rabinowitz, Neil ;

Pascanu, Razvan ;

Beattie, Charlie ;

Petersen, Stig ;

Sadik, Amir ;

Gaffney, Stephen ;

King, Helen ;

Kavukcuoglu, Koray ;

Hassabis, Demis ;

Hadsell, Raia ;

Kumaran, Dharshan .

NATURE, 2018, 557 (7705) :429-+

[7]

BRUCE A C, 2009, J GUIDANCE CONTROL D, V32, P474, DOI DOI 10.2514/1.37962

[8]

Busoniu L., 2011, Proceedings of the 2011 IEEE SSCI Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2011), P1, DOI 10.1109/ADPRL.2011.5967353

[9] The decision method research on air combat game based onuncertain interval information [J].

Chen, Xia ;

Zhao, Mingming .

2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 1, 2012, :456-459

[10]

Chithapuram C, 2014, 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), P1256, DOI 10.1109/IC3I.2014.7019634

← 1 2 3 4 5 6 7 →