The Optimal Strategies of Maneuver Decision in Air Combat of UCAV Based on the Improved TD3 Algorithm

被引:2
|
作者
Gao, Xianzhong [1 ]
Zhang, Yue [2 ]
Wang, Baolai [3 ]
Leng, Zhihui [4 ]
Hou, Zhongxi [1 ,2 ]
机构
[1] Natl Univ Def Technol, Test Ctr, Xian 710106, Peoples R China
[2] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
[4] Jiangxi Hongdu Aviat Ind Grp Co Ltd, Nanchang 330096, Peoples R China
关键词
unmanned combat aerial vehicles (UCAVs); maneuver decision-making; autonomous air combat; deep reinforcement learning; scenario-transfer training;
D O I
10.3390/drones8090501
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Nowadays, unmanned aerial vehicles (UAVs) pose a significant challenge to air defense systems. Unmanned combat aerial vehicles (UCAVs) have been proven to be an effective method to counter the threat of UAVs in application. Therefore, maneuver decision-making has become the crucial technology to achieve autonomous air combat for UCAVs. In order to solve the problem of maneuver decision-making, an autonomous model of UCAVs based on the deep reinforcement learning method was proposed in this paper. Firstly, the six-degree-of-freedom (DoF) dynamic model was built in three-dimensional space, and the continuous actions of tangential overload, normal overload, and roll angle were selected as the maneuver inputs. Secondly, to improve the convergence speed for the deep reinforcement learning method, the idea of "scenario-transfer training" was introduced into the twin delayed deep deterministic (TD3) policy gradient algorithm, the results showing that the improved algorithm could cut off about 60% of the training time. Thirdly, for the "nose-to-nose turns", which is one of the classical maneuvers for experienced pilots, the optimal maneuver generated by the proposed method was analyzed. The results showed that the maneuver strategy obtained by the proposed method was highly consistent with that made by experienced fighter pilots. This is also the first time in a public article that compared the maneuver decisions made by the deep reinforcement learning method with experienced fighter pilots. This research can provide some meaningful references to generate autonomous decision-making strategies for UCAVs.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Maneuver Decision of Autonomous Air Combat of Unmanned Combat Aerial Vehicle Based on Deep Neural Network
    Zhang H.
    Huang C.
    Xuan Y.
    Tang S.
    Binggong Xuebao/Acta Armamentarii, 2020, 41 (08): : 1613 - 1622
  • [32] Autonomous localized path planning algorithm for UAVs based on TD3 strategy
    Zhao, Feiyu
    Li, Dayan
    Wang, Zhengxu
    Mao, Jianlin
    Wang, Niya
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [33] Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm
    Yang, Jianan
    Liu, Yu
    Zhang, Jie
    Guan, Yong
    Shao, Zhenzhou
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [34] Reinforcement Learning Control of Hydraulic Servo System Based on TD3 Algorithm
    Yuan, Xiaoming
    Wang, Yu
    Zhang, Ruicong
    Gao, Qiang
    Zhou, Zhuangding
    Zhou, Rulin
    Yin, Fengyuan
    MACHINES, 2022, 10 (12)
  • [35] Research and Application of an Improved TD3 Algorithm in Mobile Robot Environment Perception and Autonomous Navigation
    Fu, Bo
    Yao, Xulin
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 158 - 162
  • [36] Autonomous localized path planning algorithm for UAVs based on TD3 strategy
    Zhao Feiyu
    Li Dayan
    Wang Zhengxu
    Mao Jianlin
    Wang Niya
    Scientific Reports, 14
  • [37] Intelligent maneuvering decision-making in two-UCAV cooperative air combat based on improved MADDPG with hybrid hyper network
    Li, Wentao
    Fang, Feng
    Wang, Zhenya
    Zhu, Yichao
    Peng, Dongliang
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (17):
  • [38] A Multi-UCAV Cooperative Decision-Making Method Based on an MAPPO Algorithm for Beyond-Visual-Range Air Combat
    Liu, Xiaoxiong
    Yin, Yi
    Su, Yuzhan
    Ming, Ruichen
    AEROSPACE, 2022, 9 (10)
  • [39] Hierarchical Online Air Combat Maneuver Decision Making and Control Based on Surrogate-Assisted Differential Evolution Algorithm
    Tan, Mulai
    Sun, Haocheng
    Ding, Dali
    Zhou, Huan
    Han, Tong
    Luo, Yuequn
    DRONES, 2025, 9 (02)
  • [40] Decision-making method for air combat maneuver based on explainable reinforcement learning
    Yang, Shuheng
    Zhang, Dong
    Xiong, Wei
    Ren, Zhi
    Tang, Shuo
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):