Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Metalearning

被引:4
作者
Gaudet, Brian [1 ]
Furfaro, Roberto [1 ,2 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, 1127 E Roger Way, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Aerosp & Mech Engn, 1127 E Roger Way, Tucson, AZ 85721 USA
关键词
VEHICLES; IMPACT; PHASE;
D O I
10.2514/1.A35396
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
An adaptive guidance system suitable for the terminal phase trajectory of a hypersonic strike weapon is optimized using reinforcement meta learning. The guidance system maps observations directly to commanded bank angle, angle of attack, and sideslip angle rates. Importantly, the observations are directly measurable from radar seeker outputs with minimal processing. The optimization framework implements a shaping reward that minimizes the line-of-sight rotation rate, with a terminal reward given if the agent satisfies path constraints and meets terminal accuracy and speed criteria. This paper shows that the guidance system can adapt to off-nominal flight conditions, including perturbation of aerodynamic coefficient parameters, actuator failure scenarios, sensor scale factor errors, and actuator lag, while satisfying heating rate, dynamic pressure, and load path constraints, as well as a minimum impact speed constraint. This paper demonstrates precision strike capability against a maneuvering ground target and the ability to divert to a new target, the latter being important to maximize strike effectiveness for a group of hypersonic strike weapons. Moreover, this paper demonstrates a threat evasion strategy against interceptors with limited midcourse correction capability, where the hypersonic strike weapon implements multiple diverts to alternate targets, with the last divert to the actual target.
引用
收藏
页码:286 / 298
页数:13
相关论文
共 27 条
  • [1] Chung JY, 2015, PR MACH LEARN RES, V37, P2067
  • [2] Nonlinear Ten-Degree-of-Freedom Dynamics Model of a Generic Hypersonic Vehicle
    Colgren, Richard
    Keshmiri, Shahriar
    Mirmirani, Maj
    [J]. JOURNAL OF AIRCRAFT, 2009, 46 (03): : 800 - 813
  • [3] Finn C, 2017, PR MACH LEARN RES, V70
  • [4] Frans K., 2017, INT C LEARN REPR
  • [5] Gaudet B., 2022, P AIAA SCITECH 2022
  • [6] Gaudet B., 2021, ARXIV PREPRINT ARXIV
  • [7] Reinforcement Metalearning for Interception of Maneuvering Exoatmospheric Targets with Parasitic Attitude Loop
    Gaudet, Brian
    Furfaro, Roberto
    Linares, Richard
    Scorsoglio, Andrea
    [J]. JOURNAL OF SPACECRAFT AND ROCKETS, 2021, 58 (02) : 386 - 399
  • [8] Deep reinforcement learning for six degree-of-freedom planetary landing
    Gaudet, Brian
    Linares, Richard
    Furfaro, Roberto
    [J]. ADVANCES IN SPACE RESEARCH, 2020, 65 (07) : 1723 - 1741
  • [9] Six degree-of-freedom body-fixed hovering over unmapped asteroids via LIDAR altimetry and reinforcement meta-learning
    Gaudet, Brian
    Linares, Richard
    Furfaro, Roberto
    [J]. ACTA ASTRONAUTICA, 2020, 172 : 90 - 99
  • [10] Terminal adaptive guidance via reinforcement meta-learning: Applications to autonomous asteroid close-proximity operations
    Gaudet, Brian
    Linares, Richard
    Furfaro, Roberto
    [J]. ACTA ASTRONAUTICA, 2020, 171 : 1 - 13