All-aspect attack guidance law for agile missiles based on deep reinforcement learning

被引：18

作者：

Gong, Xiaopeng ^{[1
]}

Chen, Wanchun ^{[1
]}

Chen, Zhongyuan ^{[1
]}

机构：

[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2022年 / 127卷

基金：

中国博士后科学基金;

关键词：

Deep reinforcement learning; Agile turn; Angle-of-attack guidance law; Hierarchical structure; All-aspect attack; High angle-of-attack; AUTOPILOT DESIGN;

D O I：

10.1016/j.ast.2022.107677

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This paper presents an all-aspect attack guidance law for agile missiles based on deep reinforcement learning (DRL), which can effectively cope with the aerodynamic uncertainty and strong nonlinearity in the high angle-of-attack (AOA) flight phase. First, to make the training environment more authentic, the full flight envelope of the missile is modeled and highly accurate aerodynamic data is obtained through Computational Fluid Dynamics (CFD) technique. Subsequently, the DRL algorithm is applied to generate an AOA guidance law for the agile turn phase. A hierarchical scheme that consists of a meta-controller for real-time decision making according to combat scenario and a sub-controller for generating guidance command is designed, which enables the guidance law to cover the whole process of the engagement and ensures the convergence of the training in the agile turn phase. Considering the current limitations of missile maneuverability, two agile turn guidance laws are developed to accommodate both limited and unlimited AOA scenarios. Moreover, the proposed guidance law has excellent generalization capability and ensures the implementation of static training and dynamic execution, which means that the missile can adapt to the realistic combat scenarios that have not been encountered during the training. Simulation results indicate that the DRL-based guidance law is nearly optimal and robust to disturbances. In addition, the proposed guidance law enables the missile to track time-varying desired turn angles to lock the maneuvering target in the rear hemisphere during the agile turn phase, providing advantageous initial conditions for the terminal guidance. Furthermore, the computational efficiency is high enough to satisfy the requirement on onboard application. (C) 2022 Elsevier Masson SAS. All rights reserved.

引用

页数：18

共 54 条

[21] Autonomous air combat maneuver decision using Bayesian inference and moving horizon optimization
Huang Changqiang
Dong Kangsheng
Huang Hanqiao
Tang Shangqin
Zhang Zhuoran
[J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2018, 29 (01) : 86 - 97
[22] Huang SY, 2022, Arxiv, DOI [arXiv:2006.14171, DOI 10.32473/FLAIRS.V35I.130584]
[23] Nonlinear guidance techniques for agile missiles
Innocenti, M
[J]. CONTROL ENGINEERING PRACTICE, 2001, 9 (10) : 1131 - 1144
[24] Path planning for asteroid hopping rovers with pre-trained deep reinforcement learning architectures
Jiang, Jianxun
Zeng, Xiangyuan
Guzzetti, Davide
You, Yuyang
[J]. ACTA ASTRONAUTICA, 2020, 171 : 265 - 279
[25] Jun B, 2012, P 28 C INT COUNC AER
[26] Kim K.U., 2010, AIAA GUIDANCE NAVIGA, P1
[27] Pitch Autopilot Design for Agile Missiles with Uncertain Aerodynamic Coefficients
Kim, Yoonsoo
Kim, Byoung Soo
[J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2013, 49 (02) : 907 - 914
[28] Kirk Robert, 2021, arXiv
[29] Autonomous closed-loop guidance using reinforcement learning in a low-thrust, multi-body dynamical environment
LaFarge, Nicholas B.
Miller, Daniel
Howell, Kathleen C.
Linares, Richard
[J]. ACTA ASTRONAUTICA, 2021, 186 : 1 - 23
[30] Agile Missile Autopilot Design using Nonlinear Backstepping Control with Time-Delay Adaptation
Lee, Chang-Hun
Kim, Tae-Hun
Tahk, Min-Jea
[J]. TRANSACTIONS OF THE JAPAN SOCIETY FOR AERONAUTICAL AND SPACE SCIENCES, 2014, 57 (01) : 9 - 20

← 1 2 3 4 5 6 →