Enhancing BVR Air Combat Agent Development With Attention-Driven Reinforcement Learning

被引：0

作者：

Kuroswiski, Andre R. ^{[1
]}

Wu, Annie S. ^{[2
]}

Passaro, Angelo ^{[3
]}

机构：

[1] Inst Tecnol Aeronaut, BR-12228900 Sao Jose Dos Campos, SP, Brazil

[2] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA

[3] Inst Estudos Avancados, BR-12228001 Sao Jose Dos Campos, SP, Brazil

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Decision making; Autonomous agents; Visualization; Training; Reinforcement learning; Atmospheric modeling; Missiles; Uncertainty; Robustness; Aircraft; Adversarial learning; artificial intelligence; autonomous agents; beyond visual range air combat; multi-head attention; reinforcement learning; UCAV;

D O I：

10.1109/ACCESS.2025.3561250

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study explores the use of Reinforcement Learning (RL) to develop autonomous agents for Beyond Visual Range (BVR) air combat, addressing the challenges of dynamic and uncertain adversarial scenarios. We propose a novel approach that introduces a task-based layer, leveraging domain expertise to optimize decision-making and training efficiency. By integrating multi-head attention mechanisms into the policy model and employing an improved DQN algorithm, agents dynamically select context-aware tasks, enabling the learning of efficient emergent behaviors for variable engagement conditions. Evaluations in single- and multi-agent BVR scenarios against adversaries with diverse tactical characteristics demonstrate superior training efficiency and enhanced agent capabilities compared to leading RL algorithms commonly applied in similar domains, including PPO, DDPG, and SAC. A robustness study underscores the critical role of diverse enemy selection in the RL process, showing that adversaries with variable tactical behaviors are essential for developing robust agents. This work advances RL methodologies for autonomous BVR air combat and provides insights applicable to other problems with challenging adversarial scenarios.

引用

页码：70446 / 70463

页数：18

共 48 条

[41]

Vaswani A, 2017, ADV NEUR IN, V30

[42] Grandmaster level in StarCraft II using multi-agent reinforcement learning [J].

Vinyals, Oriol ;

Babuschkin, Igor ;

Czarnecki, Wojciech M. ;

Mathieu, Michael ;

Dudzik, Andrew ;

Chung, Junyoung ;

Choi, David H. ;

Powell, Richard ;

Ewalds, Timo ;

Georgiev, Petko ;

Oh, Junhyuk ;

Horgan, Dan ;

Kroiss, Manuel ;

Danihelka, Ivo ;

Huang, Aja ;

Sifre, Laurent ;

Cai, Trevor ;

Agapiou, John P. ;

Jaderberg, Max ;

Vezhnevets, Alexander S. ;

Leblond, Remi ;

Pohlen, Tobias ;

Dalibard, Valentin ;

Budden, David ;

Sulsky, Yury ;

Molloy, James ;

Paine, Tom L. ;

Gulcehre, Caglar ;

Wang, Ziyu ;

Pfaff, Tobias ;

Wu, Yuhuai ;

Ring, Roman ;

Yogatama, Dani ;

Wunsch, Dario ;

McKinney, Katrina ;

Smith, Oliver ;

Schaul, Tom ;

Lillicrap, Timothy ;

Kavukcuoglu, Koray ;

Hassabis, Demis ;

Apps, Chris ;

Silver, David .

NATURE, 2019, 575 (7782) :350-+

[43] An evolutionary multi-agent reinforcement learning algorithm for multi-UAV air combat [J].

Wang, Baolai ;

Gao, Xianzhong ;

Xie, Tao .

KNOWLEDGE-BASED SYSTEMS, 2024, 299

[44] Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning [J].

Wang, Huan ;

Wang, Jintao .

SCIENTIFIC REPORTS, 2024, 14 (01)

[45] Deep reinforcement learning-based air combat maneuver decision-making: literature review, implementation tutorial and future direction [J].

Wang, Xinwei ;

Wang, Yihui ;

Su, Xichao ;

Wang, Lei ;

Lu, Chen ;

Peng, Haijun ;

Liu, Jie .

ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (01)

[46]

Weng JY, 2022, J MACH LEARN RES, V23

[47] Evasive Maneuver Strategy for UCAV in Beyond-Visual-Range Air Combat Based on Hierarchical Multi-Objective Evolutionary Algorithm [J].

Yang, Zhen ;

Zhou, Deyun ;

Piao, Haiyin ;

Zhang, Kai ;

Kong, Weiren ;

Pan, Qian .

IEEE ACCESS, 2020, 8 :46605-46623

[48]

Zhao L., 2024, P 7 INT C MACH LEARN, P1

← 1 2 3 4 5 →