Curriculum learning-based missile guidance law for intercepting maneuvering targets with high-speed

被引:0
作者
Tao, He [1 ]
Jia, Zhengxuan [2 ]
Zhu, Bing [1 ]
Lin, Tingyu [2 ]
机构
[1] Beihang Univ, Res Div 7, Beijing 100191, Peoples R China
[2] Beijing Simulat Ctr, Beijing 100854, Peoples R China
基金
中国国家自然科学基金;
关键词
Missile guidance; Reinforcement learning; Curriculum learning; Maneuvering target; BIASED PNG LAW; SYSTEMS; IMPACT;
D O I
10.1016/j.engappai.2025.110948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a missile guidance strategy is designed by using curriculum-based reinforcement learning for intercepting high-speed maneuvering targets. To enhance the exploration capabilities of agents within the state space, an exploration ability index is introduced. The proposed index, integrated with the entropy of the Soft Actor-Critic algorithm, dynamically adjusts the training process to augment exploration. In the proposed curriculum-based reinforcement learning, multiple sub-tasks are designed to progressively train the agent from simple to complex scenarios. This incremental learning framework enables the agent to efficiently master the task, expediting the overall training process. Through comprehensive simulations, we validate the efficacy of the proposed guidance law. Comparing the results obtained in other reinforcement learning methods, our findings highlight the advantages of curriculum learning in enhancing the efficiency and effectiveness of missile guidance systems.
引用
收藏
页数:12
相关论文
共 38 条
[1]  
Akkem Y., 2023, INDIAN J SCI TECHNOL, V16, P4688, DOI DOI 10.17485/IJST/v16i48.2850
[2]   A comprehensive review of synthetic data generation in smart farming by using variational autoencoder and generative adversarial network [J].
Akkem, Yaganteeswarudu ;
Biswas, Saroj Kumar ;
Varanasi, Aruna .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
[3]   Smart farming using artificial intelligence: A review [J].
Akkem, Yaganteeswarudu ;
Biswas, Saroj Kumar ;
Varanasi, Aruna .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[4]  
Bengio Y., 2009, P 26 ANN INT C MACH, P41, DOI DOI 10.1145/1553374.1553380
[5]   Three-dimensional cooperative guidance strategy and guidance law for intercepting highly maneuvering target [J].
Chen, Ziyan ;
Yu, Jianglong ;
Dong, Xiwang ;
Ren, Zhang .
CHINESE JOURNAL OF AERONAUTICS, 2021, 34 (05) :485-495
[6]   Reinforcement learning-based missile terminal guidance of maneuvering targets with decoys [J].
Deng, Tianbo ;
Huang, Hao ;
Fang, Yangwang ;
Yan, Jie ;
Cheng, Haoyu .
CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (12) :309-324
[7]  
Florensa C, 2018, PR MACH LEARN RES, V80
[8]  
Florensa C, 2017, PR MACH LEARN RES, V78
[9]   Adaptive guidance and integrated navigation with reinforcement meta-learning [J].
Gaudet, Brian ;
Linares, Richard ;
Furfaro, Roberto .
ACTA ASTRONAUTICA, 2020, 169 :180-190
[10]   Reinforcement learning for angle-only intercept guidance of maneuvering targets [J].
Gaudet, Brian ;
Furfaro, Roberto ;
Linares, Richard .
AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 99