Curriculum-RL Based Air Combat Decision-Making

被引:1
作者
He, Yuhang [1 ]
Yang, Dapeng [2 ]
Zhang, Man [2 ]
Li, Yan [1 ]
机构
[1] Northwestern Polytech Univ, Dept Nav Guidance & Control, Xian 710129, Peoples R China
[2] Avic Shenyang Aircraft Design & Res Inst, Shenyang 110035, Peoples R China
来源
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL | 2023年 / 845卷
关键词
Deep reinforcement learning; Air combat; Unmanned aerial vehicles;
D O I
10.1007/978-981-19-6613-2_447
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. But there are many challenges in using reinforcement learning to solve air combat decision-making problems. One of them is that rewards are sparse in the air combat and most of today's DRL algorithms struggle with such sparsity. Therefore, we propose a new method called Curriculum-RL which guide agent to learn a task from the simple to the complex. While solving the sparse problem, Curriculum-RL helps the agent to learn the air combat strategy better than the general weighted reward structure. In the end, we test our approach in the two-plane air combat and analyze the result.
引用
收藏
页码:4611 / 4621
页数:11
相关论文
共 8 条
[1]  
Bansal T, 2018, Arxiv, DOI [arXiv:1710.03748, 10.48550/arXiv.1710.03748, DOI 10.48550/ARXIV.1710.03748]
[2]  
Defense advanced research projects agency, 2020, Alphadogfight trial go virtual for final event EB/OL
[3]  
Liu Pin., 2017, Communications in Computer and Information Science, P274, DOI DOI 10.1007/978-981-10-6463-024
[4]  
Mcgrew J.S., 2009, Master's The-sis Mit
[5]  
Pope Adrian P., 2021, 2021 International Conference on Unmanned Aircraft Systems (ICUAS), P275, DOI 10.1109/ICUAS51884.2021.9476700
[6]  
Schulman J, 2017, Arxiv, DOI [arXiv:1707.06347, DOI 10.48550/ARXIV.1707.06347]
[7]  
YANG CD, 1998, T AERONAUTICAL ASTRO, V30, P149
[8]   Hierarchical fuzzy ART for Q-learning and its application in air combat simulation [J].
Zhou Y. ;
Ma Y. ;
Song X. ;
Gong G. .
International Journal of Modeling, Simulation, and Scientific Computing, 2017, 8 (04)