Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat

被引:0
|
作者
Zhou, Pan [1 ]
Li, Ni [2 ]
Huang, Jiangtao [2 ]
Zhang, Sheng [2 ]
Zhou, Xiaoyu [2 ]
Liu, Gang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China
[2] China Aerodynam Res & Dev Ctr, Inst Space Technol, Mianyang, Sichuan, Peoples R China
来源
2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023 | 2024年 / 1051卷
关键词
Air combat; six-degree-of-freedom modeling; autonomous decision making; situation assessment; deep reinforcement learning;
D O I
10.1007/978-981-97-4010-9_24
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
With the development of computer science, automatic control, aircraft design and other disciplines, artificial intelligence-driven Unmanned Combat Aerial Vehicle (UCAV) air combat decision-making technology has brought revolutionary changes in air combat theory and mode. Aiming at the six-degree-of-freedom UCAV close-range air combat autonomous decision-making problem, this paper proposes aUCAVair combat decision-making method based on the deep reinforcement learning method. Firstly, a close-range air combat environment model based on the six-degree-of-freedom UCAV model is developed. Secondly, an autonomous decision-making model for the UCAV close-range air combat with multi-dimensional continuous state input and multi-dimensional continuous action output is established based on the deep neural network, which receives the combat situation information and outputs the UCAV's joystick displacement commands. Then, a reward function considering the missile attack zone and air combat orientation is designed, which includes the angle reward, the distance reward and the height reward. On this basis, a twin delayed deep deterministic policy gradient algorithm is employed to train the autonomous decision-making model for air combat. Finally, simulation experiments of the UCAV close-range air combat scenario are carried out, and the simulation results show that the proposed intelligent air combat decision-making machine has a win rate 3.57 times higher than that of an expert system, and occupies an average situation reward 1.19 times higher than that of the enemy aircraft.
引用
收藏
页码:320 / 334
页数:15
相关论文
共 50 条
  • [41] Research on Multi-aircraft Cooperative Air Combat Method Based on Deep Reinforcement Learning
    Shi W.
    Feng Y.-H.
    Cheng G.-Q.
    Huang H.-L.
    Huang J.-C.
    Liu Z.
    He W.
    Feng, Yang-He (fengyanghe@nudt.edu.cn), 1610, Science Press (47): : 1610 - 1623
  • [42] DEEP REINFORCEMENT LEARNING-BASED RATE ADAPTATION FOR ADAPTIVE 360-DEGREE VIDEO STREAMING
    Kan, Nuowen
    Zou, Junni
    Tang, Kexin
    Li, Chenglin
    Liu, Ning
    Xiong, Hongkai
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4030 - 4034
  • [43] An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat
    Baykal, Yasin
    Baspinar, Baris
    2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
  • [44] Deep Reinforcement Learning-Based Attention Decision Network for Agile Earth Observation Satellite Scheduling
    Liu, Dongning
    Zhou, Guanghui
    REMOTE SENSING, 2024, 16 (23)
  • [45] Deep reinforcement learning-based long-range autonomous valet parking for smart cities
    Khalid, Muhammad
    Wang, Liang
    Wang, Kezhi
    Aslam, Nauman
    Pan, Cunhua
    Cao, Yue
    SUSTAINABLE CITIES AND SOCIETY, 2023, 89
  • [46] Autonomous Decision Making of UAV in Short-Range Air Combat Based on DQN Aided by Expert Knowledge
    Hu, Tianmi
    Hu, Jinwen
    Zhao, Chunhui
    Pan, Quan
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1661 - 1670
  • [47] Decision-making Method for Transient Stability Emergency Control Based on Deep Reinforcement Learning
    Li H.
    Zhang P.
    Liu Z.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2023, 47 (05): : 144 - 152
  • [48] Research on the Decision-Making Model of Carbon Quota Trading Based on Deep Reinforcement Learning
    Liu, Jiawen
    Huan, Jiajia
    Lan, Xiaodong
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2025, 21 (01)
  • [49] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
    Li, Ke
    Zhang, Kun
    Zhang, Zhenchong
    Liu, Zekun
    Hua, Shuai
    He, Jianliang
    SENSORS, 2021, 21 (06)
  • [50] Network Defense Decision-Making Based on Deep Reinforcement Learning and Dynamic Game Theory
    Huang, Wanwei
    Yuan, Bo
    Wang, Sunan
    Ding, Yi
    Li, Yuhua
    CHINA COMMUNICATIONS, 2024, 21 (09) : 262 - 275