An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat

被引:1
作者
Baykal, Yasin [1 ]
Baspinar, Baris [2 ]
机构
[1] Istanbul Tech Univ, Def Technol, Istanbul, Turkiye
[2] Istanbul Tech Univ, Dept Aeronaut Engn, Istanbul, Turkiye
来源
2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC | 2023年
关键词
air combat; reinforcement learning; decision-making;
D O I
10.1109/DASC58513.2023.10311295
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper presents an evolutionary reinforcement learning approach based on Deep Q Networks to address the maneuver decision challenge of unmanned aerial vehicles (UAV) in short-range aerial combat. The proposed approach aims to improve the UAVs' autonomous maneuver decision process and generate a robust policy against alternative enemy strategies. The training process involves parallel training of multiple workers, evaluation of models at regular intervals, selection of the best model, testing the best model against enemy policies, and updating the pool of enemy strategies. The proposed method continuously improves the trained models and generates more robust policies with higher win rates than standard reinforcement learning techniques or k-level learning approaches.
引用
收藏
页数:9
相关论文
共 36 条
  • [1] GAME-THEORY FOR AUTOMATED MANEUVERING DURING AIR-TO-AIR COMBAT
    AUSTIN, F
    CARBONE, G
    FALCO, M
    HINZ, H
    LEWIS, M
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1990, 13 (06) : 1143 - 1149
  • [2] Deep Reinforcement Learning-Based Air-to-Air Combat Maneuver Generation in a Realistic Environment
    Bae, Jung Ho
    Jung, Hoseong
    Kim, Seogbong
    Kim, Sungho
    Kim, Yong-Duk
    [J]. IEEE ACCESS, 2023, 11 : 26427 - 26440
  • [3] Baspinar B., 2018, AIAA MOD SIM TECHN C
  • [4] Baspinar B., 2019, AIAA SCITECH FOR
  • [5] Assessment of Aerial Combat Game via Optimization-Based Receding Horizon Control
    Baspinar, Baris
    Koyuncu, Emre
    [J]. IEEE ACCESS, 2020, 8 : 35853 - 35863
  • [6] Burgin G.H., 1975, Contractor Report, VI
  • [7] Burgin GH, 1975, ADAPTIVE MANEUVERING, V2
  • [8] Autonomous Maneuver Decision of UCAV Air Combat Based on Double Deep Q Network Algorithm and Stochastic Game Theory
    Cao, Yuan
    Kou, Ying-Xin
    Li, Zhan-Wu
    Xu, An
    [J]. INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2023, 2023
  • [9] A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat
    Chai, Jiajun
    Chen, Wenzhang
    Zhu, Yuanheng
    Yao, Zong-Xin
    Zhao, Dongbin
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (09): : 5417 - 5429
  • [10] Chen YY, 2020, I C CONT AUTOMAT ROB, P817, DOI [10.1109/ICARCV50220.2020.9305467, 10.1109/icarcv50220.2020.9305467]