Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat

被引:0
|
作者
Zhou, Pan [1 ]
Li, Ni [2 ]
Huang, Jiangtao [2 ]
Zhang, Sheng [2 ]
Zhou, Xiaoyu [2 ]
Liu, Gang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China
[2] China Aerodynam Res & Dev Ctr, Inst Space Technol, Mianyang, Sichuan, Peoples R China
来源
2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023 | 2024年 / 1051卷
关键词
Air combat; six-degree-of-freedom modeling; autonomous decision making; situation assessment; deep reinforcement learning;
D O I
10.1007/978-981-97-4010-9_24
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
With the development of computer science, automatic control, aircraft design and other disciplines, artificial intelligence-driven Unmanned Combat Aerial Vehicle (UCAV) air combat decision-making technology has brought revolutionary changes in air combat theory and mode. Aiming at the six-degree-of-freedom UCAV close-range air combat autonomous decision-making problem, this paper proposes aUCAVair combat decision-making method based on the deep reinforcement learning method. Firstly, a close-range air combat environment model based on the six-degree-of-freedom UCAV model is developed. Secondly, an autonomous decision-making model for the UCAV close-range air combat with multi-dimensional continuous state input and multi-dimensional continuous action output is established based on the deep neural network, which receives the combat situation information and outputs the UCAV's joystick displacement commands. Then, a reward function considering the missile attack zone and air combat orientation is designed, which includes the angle reward, the distance reward and the height reward. On this basis, a twin delayed deep deterministic policy gradient algorithm is employed to train the autonomous decision-making model for air combat. Finally, simulation experiments of the UCAV close-range air combat scenario are carried out, and the simulation results show that the proposed intelligent air combat decision-making machine has a win rate 3.57 times higher than that of an expert system, and occupies an average situation reward 1.19 times higher than that of the enemy aircraft.
引用
收藏
页码:320 / 334
页数:15
相关论文
共 50 条
  • [21] Learning and Fast Adaptation for Air Combat Decision with Improved Deep Meta-reinforcement Learning
    Zhang, Pin
    Dong, Wenhan
    Cai, Ming
    Li, Dunwang
    Zhang, Xin
    INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2024,
  • [22] Deep Reinforcement Learning-Based Decision Making of Lane Change Considering Rear Vehicle Deceleration
    Jo G.-H.
    Park T.-H.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (06) : 602 - 607
  • [23] Deep Reinforcement-Learning-Based Air-Combat-Maneuver Generation Framework
    Mei, Junru
    Li, Ge
    Huang, Hesong
    MATHEMATICS, 2024, 12 (19)
  • [24] Maneuver Strategy Generation of UCAV for within Visual Range Air Combat Based on Multi-Agent Reinforcement Learning and Target Position Prediction
    Kong, Weiren
    Zhou, Deyun
    Yang, Zhen
    Zhang, Kai
    Zeng, Lina
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [25] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
    Li, Bo
    Huang, Jingyi
    Bai, Shuangxia
    Gan, Zhigang
    Liang, Shiyang
    Evgeny, Neretin
    Yao, Shouwen
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
  • [26] Application of Deep Reinforcement Learning in Maneuver Planning of Beyond-Visual-Range Air Combat
    Hu, Dongyuan
    Yang, Rennong
    Zuo, Jialiang
    Zhang, Ze
    Wu, Jun
    Wang, Ying
    IEEE ACCESS, 2021, 9 : 32282 - 32297
  • [27] Deep reinforcement learning based decision making for radar jamming suppression
    Xiao, Yihan
    Cao, Zongheng
    Yu, Xiangzhen
    Jiang, Yilin
    DIGITAL SIGNAL PROCESSING, 2024, 151
  • [28] Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms
    Xu, Yuting
    Wang, Chao
    Liang, Jiakai
    Yue, Keqiang
    Li, Wenjun
    Zheng, Shilian
    Zhao, Zhijin
    ENTROPY, 2022, 24 (10)
  • [29] Autonomous maneuver decision-making for a UCAV in short-range aerial combat based on an MS-DDQN algorithm
    Li, Yong-feng
    Shi, Jing-ping
    Jiang, Wei
    Zhang, Wei-guo
    Lyu, Yong-xi
    DEFENCE TECHNOLOGY, 2022, 18 (09) : 1697 - 1714
  • [30] Single degree of freedom control based on deep reinforcement learning for underwater unmanned vehicle
    Li, Nan
    Zhao, Changming
    Shi, Yan
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024,