All-aspect attack guidance law for agile missiles based on deep reinforcement learning

被引:18
作者
Gong, Xiaopeng [1 ]
Chen, Wanchun [1 ]
Chen, Zhongyuan [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
基金
中国博士后科学基金;
关键词
Deep reinforcement learning; Agile turn; Angle-of-attack guidance law; Hierarchical structure; All-aspect attack; High angle-of-attack; AUTOPILOT DESIGN;
D O I
10.1016/j.ast.2022.107677
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper presents an all-aspect attack guidance law for agile missiles based on deep reinforcement learning (DRL), which can effectively cope with the aerodynamic uncertainty and strong nonlinearity in the high angle-of-attack (AOA) flight phase. First, to make the training environment more authentic, the full flight envelope of the missile is modeled and highly accurate aerodynamic data is obtained through Computational Fluid Dynamics (CFD) technique. Subsequently, the DRL algorithm is applied to generate an AOA guidance law for the agile turn phase. A hierarchical scheme that consists of a meta-controller for real-time decision making according to combat scenario and a sub-controller for generating guidance command is designed, which enables the guidance law to cover the whole process of the engagement and ensures the convergence of the training in the agile turn phase. Considering the current limitations of missile maneuverability, two agile turn guidance laws are developed to accommodate both limited and unlimited AOA scenarios. Moreover, the proposed guidance law has excellent generalization capability and ensures the implementation of static training and dynamic execution, which means that the missile can adapt to the realistic combat scenarios that have not been encountered during the training. Simulation results indicate that the DRL-based guidance law is nearly optimal and robust to disturbances. In addition, the proposed guidance law enables the missile to track time-varying desired turn angles to lock the maneuvering target in the rear hemisphere during the agile turn phase, providing advantageous initial conditions for the terminal guidance. Furthermore, the computational efficiency is high enough to satisfy the requirement on onboard application. (C) 2022 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:18
相关论文
共 54 条
  • [21] Autonomous air combat maneuver decision using Bayesian inference and moving horizon optimization
    Huang Changqiang
    Dong Kangsheng
    Huang Hanqiao
    Tang Shangqin
    Zhang Zhuoran
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2018, 29 (01) : 86 - 97
  • [22] Huang SY, 2022, Arxiv, DOI [arXiv:2006.14171, DOI 10.32473/FLAIRS.V35I.130584]
  • [23] Nonlinear guidance techniques for agile missiles
    Innocenti, M
    [J]. CONTROL ENGINEERING PRACTICE, 2001, 9 (10) : 1131 - 1144
  • [24] Path planning for asteroid hopping rovers with pre-trained deep reinforcement learning architectures
    Jiang, Jianxun
    Zeng, Xiangyuan
    Guzzetti, Davide
    You, Yuyang
    [J]. ACTA ASTRONAUTICA, 2020, 171 : 265 - 279
  • [25] Jun B, 2012, P 28 C INT COUNC AER
  • [26] Kim K.U., 2010, AIAA GUIDANCE NAVIGA, P1
  • [27] Pitch Autopilot Design for Agile Missiles with Uncertain Aerodynamic Coefficients
    Kim, Yoonsoo
    Kim, Byoung Soo
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2013, 49 (02) : 907 - 914
  • [28] Kirk Robert, 2021, arXiv
  • [29] Autonomous closed-loop guidance using reinforcement learning in a low-thrust, multi-body dynamical environment
    LaFarge, Nicholas B.
    Miller, Daniel
    Howell, Kathleen C.
    Linares, Richard
    [J]. ACTA ASTRONAUTICA, 2021, 186 : 1 - 23
  • [30] Agile Missile Autopilot Design using Nonlinear Backstepping Control with Time-Delay Adaptation
    Lee, Chang-Hun
    Kim, Tae-Hun
    Tahk, Min-Jea
    [J]. TRANSACTIONS OF THE JAPAN SOCIETY FOR AERONAUTICAL AND SPACE SCIENCES, 2014, 57 (01) : 9 - 20