Intelligent guidance for no⁃fly zone avoidance based on reinforcement learning

被引:0
作者
Hui J. [1 ]
Wang R. [2 ]
Guo J. [1 ]
机构
[1] School of Astronautics, Harbin Institute of Technology, Harbin
[2] China Academy of Aerospace Science and Innovation, Beijing
来源
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica | 2023年 / 44卷 / 11期
关键词
intelligent guidance; no-fly zone avoidance; PPO algorithm; reinforcement learning; supervised learning;
D O I
10.7527/S1000-6893.2022.27416
中图分类号
学科分类号
摘要
The rapid development of Artificial Intelligence(AI)provides a new technical approach for the research of aircraft guidance. Aiming at the problem of reentry aircraft for avoiding uncertain no-fly zone,we propose the research frame of“predictor-corrector guidance-pre-training of bank angle guidance model based on supervised learning-further training of bank angle guidance model based on reinforcement learning”. On the one hand,lots of flying trajectory for avoiding no-fly zone are produced by predictor-corrector guidance. The bank angle guidance model is pre-trained with supervised learning algorithm. On the other hand,the bank angle guidance model is further trained by the use of Proximal Policy Optimization(PPO)algorithm. A large number of exploration interactions are taken between aircraft and environment with uncertain no-fly-zone. At the same time,the powerful lateral maneuverability of high lift-drag ratio reentry aircraft is exploited with effective reward. Such method will get rid of restriction of bank angle solution space produced by predictor-corrector guidance,which is expected to produce better strategy for avoiding no-fly zone. By comparing with traditional predictor-corrector guidance and intelligent guidance based on supervised learning,it is verified that the no-fly zone intelligent guidance technology based on reinforcement learning can fully exploit the wide area flight advantages of aircraft,so as to meet the adaptability requirements of future intelligent decision system under uncertain scenarios. © 2023 AAAS Press of Chinese Society of Aeronautics and Astronautics. All rights reserved.
引用
收藏
相关论文
共 36 条
  • [1] BAO W M., Present situation and development tendency of aerospace control techniques[J], Acta Automatica Sinica, 39, 6, pp. 697-702, (2013)
  • [2] GAO C S, CHEN E K, JING W X., Maneuver evasion trajectory optimization for hypersonic vehicles[J], Journal of Harbin Institute of Technology, 49, 4, pp. 16-21, (2017)
  • [3] LI K, NIE W S, FENG B M., Research on elusion capability of boost-glide vehicle[J], Flight Dynamics, 31, 2, pp. 148-151, (2013)
  • [4] LU Q, ZHOU M., Reentry guidance for hypersonic vehicle considering no-fly zone[J], Journal of Northwestern Polytechnical University, 35, 5, pp. 749-754, (2017)
  • [5] GAO X, ZHANG L, WEI C Z., Rapid trajectory planning for reentry glide vehicle satisfying no-fly zone constraint[J], Tactical Missile Technology, 5, pp. 62-67, (2018)
  • [6] ZHAO J,, ZHOU R, ZHANG C., Predictor-corrector reentry guidance satisfying no-fly zone constraints[J], Journal of Beijing University of Aeronautics and Astronautics, 41, 5, pp. 864-870, (2015)
  • [7] LIANG Z X, LIU S Y,, LI Q D,, Et al., Lateral entry guidance with no-fly zone constraint[J], Aerospace Science and Technology, 60, pp. 39-47, (2017)
  • [8] ZHANG D, LIU L, WANG Y J., On-line reentry guidance algorithm with both path and no-fly zone constraints [J], Acta Astronautica, 117, pp. 243-253, (2015)
  • [9] ZHAO L B, DONG C,, Et al., Evasion guidance of re-entry vehicle satisfying no-fly zone constraints based on virtual goals[J], Scientia Sinica(Physica,Mechanica & Astronomica, 51, 10, pp. 65-74, (2021)
  • [10] ZHANG J L, ZHOU D P,, YANG D P,, Et al., Computation method for reachable domain of aerospace plane under the influence of no-fly zone[J], Acta Aeronautica et Astronautica Sinica, 42, 8, (2021)