An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat

被引：1

作者：

Baykal, Yasin ^{[1
]}

Baspinar, Baris ^{[2
]}

机构：

[1] Istanbul Tech Univ, Def Technol, Istanbul, Turkiye

[2] Istanbul Tech Univ, Dept Aeronaut Engn, Istanbul, Turkiye

来源：

2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC | 2023年

关键词：

air combat; reinforcement learning; decision-making;

D O I：

10.1109/DASC58513.2023.10311295

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This paper presents an evolutionary reinforcement learning approach based on Deep Q Networks to address the maneuver decision challenge of unmanned aerial vehicles (UAV) in short-range aerial combat. The proposed approach aims to improve the UAVs' autonomous maneuver decision process and generate a robust policy against alternative enemy strategies. The training process involves parallel training of multiple workers, evaluation of models at regular intervals, selection of the best model, testing the best model against enemy policies, and updating the pool of enemy strategies. The proposed method continuously improves the trained models and generates more robust policies with higher win rates than standard reinforcement learning techniques or k-level learning approaches.

引用

页数：9

共 36 条

[1] GAME-THEORY FOR AUTOMATED MANEUVERING DURING AIR-TO-AIR COMBAT
AUSTIN, F
CARBONE, G
FALCO, M
HINZ, H
LEWIS, M
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1990, 13 (06) : 1143 - 1149
[2] Deep Reinforcement Learning-Based Air-to-Air Combat Maneuver Generation in a Realistic Environment
Bae, Jung Ho
Jung, Hoseong
Kim, Seogbong
Kim, Sungho
Kim, Yong-Duk
[J]. IEEE ACCESS, 2023, 11 : 26427 - 26440
[3] Baspinar B., 2018, AIAA MOD SIM TECHN C
[4] Baspinar B., 2019, AIAA SCITECH FOR
[5] Assessment of Aerial Combat Game via Optimization-Based Receding Horizon Control
Baspinar, Baris
Koyuncu, Emre
[J]. IEEE ACCESS, 2020, 8 : 35853 - 35863
[6] Burgin G.H., 1975, Contractor Report, VI
[7] Burgin GH, 1975, ADAPTIVE MANEUVERING, V2
[8] Autonomous Maneuver Decision of UCAV Air Combat Based on Double Deep Q Network Algorithm and Stochastic Game Theory
Cao, Yuan
Kou, Ying-Xin
Li, Zhan-Wu
Xu, An
[J]. INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2023, 2023
[9] A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat
Chai, Jiajun
Chen, Wenzhang
Zhu, Yuanheng
Yao, Zong-Xin
Zhao, Dongbin
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (09): : 5417 - 5429
[10] Chen YY, 2020, I C CONT AUTOMAT ROB, P817, DOI [10.1109/ICARCV50220.2020.9305467, 10.1109/icarcv50220.2020.9305467]

← 1 2 3 4 →