Curriculum-RL Based Air Combat Decision-Making

被引：1

作者：

He, Yuhang ^{[1
]}

Yang, Dapeng ^{[2
]}

Zhang, Man ^{[2
]}

Li, Yan ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Dept Nav Guidance & Control, Xian 710129, Peoples R China

[2] Avic Shenyang Aircraft Design & Res Inst, Shenyang 110035, Peoples R China

来源：

ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL | 2023年 / 845卷

关键词：

Deep reinforcement learning; Air combat; Unmanned aerial vehicles;

D O I：

10.1007/978-981-19-6613-2_447

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep reinforcement learning has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. But there are many challenges in using reinforcement learning to solve air combat decision-making problems. One of them is that rewards are sparse in the air combat and most of today's DRL algorithms struggle with such sparsity. Therefore, we propose a new method called Curriculum-RL which guide agent to learn a task from the simple to the complex. While solving the sparse problem, Curriculum-RL helps the agent to learn the air combat strategy better than the general weighted reward structure. In the end, we test our approach in the two-plane air combat and analyze the result.

引用

页码：4611 / 4621

页数：11

共 8 条

[1]

Bansal T, 2018, Arxiv, DOI [arXiv:1710.03748, 10.48550/arXiv.1710.03748, DOI 10.48550/ARXIV.1710.03748]

[2]

Defense advanced research projects agency, 2020, Alphadogfight trial go virtual for final event EB/OL

[3]

Liu Pin., 2017, Communications in Computer and Information Science, P274, DOI DOI 10.1007/978-981-10-6463-024

[4]

Mcgrew J.S., 2009, Master's The-sis Mit

[5]

Pope Adrian P., 2021, 2021 International Conference on Unmanned Aircraft Systems (ICUAS), P275, DOI 10.1109/ICUAS51884.2021.9476700

[6]

Schulman J, 2017, Arxiv, DOI [arXiv:1707.06347, DOI 10.48550/ARXIV.1707.06347]

[7]

YANG CD, 1998, T AERONAUTICAL ASTRO, V30, P149

[8] Hierarchical fuzzy ART for Q-learning and its application in air combat simulation [J].

Zhou Y. ;

Ma Y. ;

Song X. ;

Gong G. .

International Journal of Modeling, Simulation, and Scientific Computing, 2017, 8 (04)

← 1 →