The Optimal Strategies of Maneuver Decision in Air Combat of UCAV Based on the Improved TD3 Algorithm

被引:2
|
作者
Gao, Xianzhong [1 ]
Zhang, Yue [2 ]
Wang, Baolai [3 ]
Leng, Zhihui [4 ]
Hou, Zhongxi [1 ,2 ]
机构
[1] Natl Univ Def Technol, Test Ctr, Xian 710106, Peoples R China
[2] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
[4] Jiangxi Hongdu Aviat Ind Grp Co Ltd, Nanchang 330096, Peoples R China
关键词
unmanned combat aerial vehicles (UCAVs); maneuver decision-making; autonomous air combat; deep reinforcement learning; scenario-transfer training;
D O I
10.3390/drones8090501
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Nowadays, unmanned aerial vehicles (UAVs) pose a significant challenge to air defense systems. Unmanned combat aerial vehicles (UCAVs) have been proven to be an effective method to counter the threat of UAVs in application. Therefore, maneuver decision-making has become the crucial technology to achieve autonomous air combat for UCAVs. In order to solve the problem of maneuver decision-making, an autonomous model of UCAVs based on the deep reinforcement learning method was proposed in this paper. Firstly, the six-degree-of-freedom (DoF) dynamic model was built in three-dimensional space, and the continuous actions of tangential overload, normal overload, and roll angle were selected as the maneuver inputs. Secondly, to improve the convergence speed for the deep reinforcement learning method, the idea of "scenario-transfer training" was introduced into the twin delayed deep deterministic (TD3) policy gradient algorithm, the results showing that the improved algorithm could cut off about 60% of the training time. Thirdly, for the "nose-to-nose turns", which is one of the classical maneuvers for experienced pilots, the optimal maneuver generated by the proposed method was analyzed. The results showed that the maneuver strategy obtained by the proposed method was highly consistent with that made by experienced fighter pilots. This is also the first time in a public article that compared the maneuver decisions made by the deep reinforcement learning method with experienced fighter pilots. This research can provide some meaningful references to generate autonomous decision-making strategies for UCAVs.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] One-to-one Air-combat Maneuver Strategy Based on Improved TD3 Algorithm
    Qiu, Xuyi
    Yao, Ziyu
    Tan, Fuwei
    Zhu, Zhen
    Lu, Jun-Guo
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5719 - 5725
  • [2] Intelligent Air Combat Maneuvering Decision Based on TD3 Algorithm
    Zhou Xiaoyu
    Huang Jiangtao
    Zhu Zhe
    Zhang Sheng
    Zhou Pan
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1082 - 1094
  • [3] Maneuver decision of UCAV in air combat based on deep reinforcement learning
    Li, Yongfeng
    Shi, Jingping
    Zhang, Weiguo
    Jiang, Wei
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2021, 53 (12): : 33 - 41
  • [4] An Air Combat UCAV Autonomous Maneuver Decision Method Based on LSTM Network and MCDTS
    Dang, Fangyuan
    Zhu, Huaguang
    Ning, Xin
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 857 - 866
  • [5] Autonomous Maneuver Decision of UCAV Air Combat Based on Double Deep Q Network Algorithm and Stochastic Game Theory
    Cao, Yuan
    Kou, Ying-Xin
    Li, Zhan-Wu
    Xu, An
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2023, 2023
  • [6] Air Combat Maneuver Decision Based on Reinforcement Genetic Algorithm
    Xie J.
    Yang Q.
    Dai S.
    Wang W.
    Zhang J.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2020, 38 (06): : 1330 - 1338
  • [7] Inspection Robot Navigation Based on Improved TD3 Algorithm
    Huang, Bo
    Xie, Jiacheng
    Yan, Jiawei
    SENSORS, 2024, 24 (08)
  • [8] Air combat maneuver decision-making based on improved symbiotic organisms search algorithm
    Gao Y.
    Yu M.
    Han Q.
    Dong X.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (03): : 429 - 436
  • [9] Application of the improved BAS-TIMS algorithm in air combat maneuver decision
    Ji H.
    Yu M.
    Qiao X.
    Yang H.
    Zhang S.
    Yang, Haiyan (lwzy1008@163.com), 1600, National University of Defense Technology (42): : 123 - 133
  • [10] UAV Air Combat Autonomous Maneuver Decision Based on DDPG Algorithm
    Yang, Qiming
    Zhu, Yan
    Zhang, Jiandong
    Qiao, Shasha
    Liu, Jieling
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 37 - 42