Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat

被引：0

作者：

Zhou, Pan ^{[1
]}

Li, Ni ^{[2
]}

Huang, Jiangtao ^{[2
]}

Zhang, Sheng ^{[2
]}

Zhou, Xiaoyu ^{[2
]}

Liu, Gang ^{[2
]}

机构：

[1] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China

[2] China Aerodynam Res & Dev Ctr, Inst Space Technol, Mianyang, Sichuan, Peoples R China

来源：

2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023 | 2024年 / 1051卷

关键词：

Air combat; six-degree-of-freedom modeling; autonomous decision making; situation assessment; deep reinforcement learning;

D O I：

10.1007/978-981-97-4010-9_24

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

With the development of computer science, automatic control, aircraft design and other disciplines, artificial intelligence-driven Unmanned Combat Aerial Vehicle (UCAV) air combat decision-making technology has brought revolutionary changes in air combat theory and mode. Aiming at the six-degree-of-freedom UCAV close-range air combat autonomous decision-making problem, this paper proposes aUCAVair combat decision-making method based on the deep reinforcement learning method. Firstly, a close-range air combat environment model based on the six-degree-of-freedom UCAV model is developed. Secondly, an autonomous decision-making model for the UCAV close-range air combat with multi-dimensional continuous state input and multi-dimensional continuous action output is established based on the deep neural network, which receives the combat situation information and outputs the UCAV's joystick displacement commands. Then, a reward function considering the missile attack zone and air combat orientation is designed, which includes the angle reward, the distance reward and the height reward. On this basis, a twin delayed deep deterministic policy gradient algorithm is employed to train the autonomous decision-making model for air combat. Finally, simulation experiments of the UCAV close-range air combat scenario are carried out, and the simulation results show that the proposed intelligent air combat decision-making machine has a win rate 3.57 times higher than that of an expert system, and occupies an average situation reward 1.19 times higher than that of the enemy aircraft.

引用

页码：320 / 334

页数：15

共 50 条

[41] Research on Multi-aircraft Cooperative Air Combat Method Based on Deep Reinforcement Learning
Shi W.
Feng Y.-H.
Cheng G.-Q.
Huang H.-L.
Huang J.-C.
Liu Z.
He W.
Feng, Yang-He (fengyanghe@nudt.edu.cn), 1610, Science Press (47): : 1610 - 1623
[42] DEEP REINFORCEMENT LEARNING-BASED RATE ADAPTATION FOR ADAPTIVE 360-DEGREE VIDEO STREAMING
Kan, Nuowen
Zou, Junni
Tang, Kexin
Li, Chenglin
Liu, Ning
Xiong, Hongkai
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4030 - 4034
[43] An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat
Baykal, Yasin
Baspinar, Baris
2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
[44] Deep Reinforcement Learning-Based Attention Decision Network for Agile Earth Observation Satellite Scheduling
Liu, Dongning
Zhou, Guanghui
REMOTE SENSING, 2024, 16 (23)
[45] Deep reinforcement learning-based long-range autonomous valet parking for smart cities
Khalid, Muhammad
Wang, Liang
Wang, Kezhi
Aslam, Nauman
Pan, Cunhua
Cao, Yue
SUSTAINABLE CITIES AND SOCIETY, 2023, 89
[46] Autonomous Decision Making of UAV in Short-Range Air Combat Based on DQN Aided by Expert Knowledge
Hu, Tianmi
Hu, Jinwen
Zhao, Chunhui
Pan, Quan
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1661 - 1670
[47] Decision-making Method for Transient Stability Emergency Control Based on Deep Reinforcement Learning
Li H.
Zhang P.
Liu Z.
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2023, 47 (05): : 144 - 152
[48] Research on the Decision-Making Model of Carbon Quota Trading Based on Deep Reinforcement Learning
Liu, Jiawen
Huan, Jiajia
Lan, Xiaodong
INTERNATIONAL JOURNAL OF E-COLLABORATION, 2025, 21 (01)
[49] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
Li, Ke
Zhang, Kun
Zhang, Zhenchong
Liu, Zekun
Hua, Shuai
He, Jianliang
SENSORS, 2021, 21 (06)
[50] Network Defense Decision-Making Based on Deep Reinforcement Learning and Dynamic Game Theory
Huang, Wanwei
Yuan, Bo
Wang, Sunan
Ding, Yi
Li, Yuhua
CHINA COMMUNICATIONS, 2024, 21 (09) : 262 - 275

← 1 2 3 4 5 →