A Real-Time and Optimal Hypersonic Entry Guidance Method Using Inverse Reinforcement Learning

被引:2
|
作者
Su, Linfeng [1 ]
Wang, Jinbo [1 ]
Chen, Hongbo [1 ]
Pezzella, Giuseppe
机构
[1] Sun Yat Sen Univ, Sch Syst Sci & Engn, Guangzhou 510006, Peoples R China
关键词
hypersonic entry; inverse reinforcement learning; few datasets; autonomous guidance; real-time optimal control; TRAJECTORY OPTIMIZATION; CONVEX-OPTIMIZATION;
D O I
10.3390/aerospace10110948
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The mission of hypersonic vehicles faces the problem of highly nonlinear dynamics and complex environments, which presents challenges to the intelligent level and real-time performance of onboard guidance algorithms. In this paper, inverse reinforcement learning is used to address the hypersonic entry guidance problem. The state-control sample pairs and state-rewards sample pairs obtained by interacting with hypersonic entry dynamics are used to train the neural network by applying the distributed proximal policy optimization method. To overcome the sparse reward problem in the hypersonic entry problem, a novel reward function combined with a sophisticated discriminator network is designed to generate dense optimal rewards continuously, which is the main contribution of this paper. The optimized guidance methodology can achieve good terminal accuracy and high success rates with a small number of trajectories as datasets while satisfying heating rate, overload, and dynamic pressure constraints. The proposed guidance method is employed for two typical hypersonic entry vehicles (Common Aero Vehicle-Hypersonic and Reusable Launch Vehicle) to demonstrate the feasibility and potential. Numerical simulation results validate the real-time performance and optimality of the proposed method and indicate its suitability for onboard applications in the hypersonic entry flight.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Real-time Optimal Allocation for Uncertain Time Coupled Resource Based on Reinforcement Learning
    Huang, Qilong
    Yang, Li
    Chi, Cheng
    Kong, XiangGuang
    Zhou, Cangqi
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1264 - 1269
  • [22] Real-time Energy Management of Microgrid Using Reinforcement Learning
    Bi, Wenzheng
    Shu, Yuankai
    Dong, Wei
    Yang, Qiang
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 38 - 41
  • [23] Simulation of reinforcement learning based real-time guidance of proton therapy for mobile tumors
    Ghislain, Melanie
    Dasnoy, Damien
    Macq, Benoit
    RADIOTHERAPY AND ONCOLOGY, 2024, 194 : S4492 - S4495
  • [24] Real-time Guidance Strategy for Active Defense Aircraft via Deep Reinforcement Learning
    Li, Zhi
    Wu, Jinze
    Wu, Yuanpei
    Zheng, Yu
    Li, Meng
    Liang, Haizhao
    PROCEEDINGS OF THE 2021 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2021, : 177 - 183
  • [25] Graph reinforcement learning for real-time optimal dispatch of active distribution network
    Chen J.-B.
    Yu T.
    Pan Z.-N.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (06): : 999 - 1008
  • [26] Reinforcement learning: A new technique for the real-time optimal control of hydraulic networks
    Wilson, G
    HYDROINFORMATICS '96, VOLS 1 AND 2, 1996, : 893 - 900
  • [27] Application of Reinforcement Learning for Real-Time Optimal Control of the Pellet Induration Process
    Jayasree Biswas
    Akash Goyal
    Balaji Selvanathan
    Sri Harsha Nistala
    Venkataramana Runkana
    Transactions of the Indian Institute of Metals, 2022, 75 : 2539 - 2546
  • [28] Application of Reinforcement Learning for Real-Time Optimal Control of the Pellet Induration Process
    Biswas, Jayasree
    Goyal, Akash
    Selvanathan, Balaji
    Nistala, Sri Harsha
    Runkana, Venkataramana
    TRANSACTIONS OF THE INDIAN INSTITUTE OF METALS, 2022, 75 (10) : 2539 - 2546
  • [29] A real-time tracking controller for piezoelectric actuators based on reinforcement learning and inverse compensation
    Qin, Shijie
    Cheng, Long
    SUSTAINABLE CITIES AND SOCIETY, 2021, 69
  • [30] A Real-Time Reentry Guidance Method for Hypersonic Vehicles Based on a Time2vec and Transformer Network
    Song, Jia
    Tong, Xindi
    Xu, Xiaowei
    Zhao, Kai
    AEROSPACE, 2022, 9 (08)