Deep Reinforcement Learning-Based Decision Making for Six Degree of Freedom UCAV Close Range Air Combat

被引：0

作者：

Zhou, Pan ^{[1
]}

Li, Ni ^{[2
]}

Huang, Jiangtao ^{[2
]}

Zhang, Sheng ^{[2
]}

Zhou, Xiaoyu ^{[2
]}

Liu, Gang ^{[2
]}

机构：

[1] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China

[2] China Aerodynam Res & Dev Ctr, Inst Space Technol, Mianyang, Sichuan, Peoples R China

来源：

2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023 | 2024年 / 1051卷

关键词：

Air combat; six-degree-of-freedom modeling; autonomous decision making; situation assessment; deep reinforcement learning;

D O I：

10.1007/978-981-97-4010-9_24

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

With the development of computer science, automatic control, aircraft design and other disciplines, artificial intelligence-driven Unmanned Combat Aerial Vehicle (UCAV) air combat decision-making technology has brought revolutionary changes in air combat theory and mode. Aiming at the six-degree-of-freedom UCAV close-range air combat autonomous decision-making problem, this paper proposes aUCAVair combat decision-making method based on the deep reinforcement learning method. Firstly, a close-range air combat environment model based on the six-degree-of-freedom UCAV model is developed. Secondly, an autonomous decision-making model for the UCAV close-range air combat with multi-dimensional continuous state input and multi-dimensional continuous action output is established based on the deep neural network, which receives the combat situation information and outputs the UCAV's joystick displacement commands. Then, a reward function considering the missile attack zone and air combat orientation is designed, which includes the angle reward, the distance reward and the height reward. On this basis, a twin delayed deep deterministic policy gradient algorithm is employed to train the autonomous decision-making model for air combat. Finally, simulation experiments of the UCAV close-range air combat scenario are carried out, and the simulation results show that the proposed intelligent air combat decision-making machine has a win rate 3.57 times higher than that of an expert system, and occupies an average situation reward 1.19 times higher than that of the enemy aircraft.

引用

页码：320 / 334

页数：15

共 50 条

[21] Learning and Fast Adaptation for Air Combat Decision with Improved Deep Meta-reinforcement Learning
Zhang, Pin
Dong, Wenhan
Cai, Ming
Li, Dunwang
Zhang, Xin
INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2024,
[22] Deep Reinforcement Learning-Based Decision Making of Lane Change Considering Rear Vehicle Deceleration
Jo G.-H.
Park T.-H.
Journal of Institute of Control, Robotics and Systems, 2022, 28 (06) : 602 - 607
[23] Deep Reinforcement-Learning-Based Air-Combat-Maneuver Generation Framework
Mei, Junru
Li, Ge
Huang, Hesong
MATHEMATICS, 2024, 12 (19)
[24] Maneuver Strategy Generation of UCAV for within Visual Range Air Combat Based on Multi-Agent Reinforcement Learning and Target Position Prediction
Kong, Weiren
Zhou, Deyun
Yang, Zhen
Zhang, Kai
Zeng, Lina
APPLIED SCIENCES-BASEL, 2020, 10 (15):
[25] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
Li, Bo
Huang, Jingyi
Bai, Shuangxia
Gan, Zhigang
Liang, Shiyang
Evgeny, Neretin
Yao, Shouwen
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
[26] Application of Deep Reinforcement Learning in Maneuver Planning of Beyond-Visual-Range Air Combat
Hu, Dongyuan
Yang, Rennong
Zuo, Jialiang
Zhang, Ze
Wu, Jun
Wang, Ying
IEEE ACCESS, 2021, 9 : 32282 - 32297
[27] Deep reinforcement learning based decision making for radar jamming suppression
Xiao, Yihan
Cao, Zongheng
Yu, Xiangzhen
Jiang, Yilin
DIGITAL SIGNAL PROCESSING, 2024, 151
[28] Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms
Xu, Yuting
Wang, Chao
Liang, Jiakai
Yue, Keqiang
Li, Wenjun
Zheng, Shilian
Zhao, Zhijin
ENTROPY, 2022, 24 (10)
[29] Autonomous maneuver decision-making for a UCAV in short-range aerial combat based on an MS-DDQN algorithm
Li, Yong-feng
Shi, Jing-ping
Jiang, Wei
Zhang, Wei-guo
Lyu, Yong-xi
DEFENCE TECHNOLOGY, 2022, 18 (09) : 1697 - 1714
[30] Single degree of freedom control based on deep reinforcement learning for underwater unmanned vehicle
Li, Nan
Zhao, Changming
Shi, Yan
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024,

← 1 2 3 4 5 →