A Real-Time and Optimal Hypersonic Entry Guidance Method Using Inverse Reinforcement Learning

被引：2

作者：

Su, Linfeng ^{[1
]}

Wang, Jinbo ^{[1
]}

Chen, Hongbo ^{[1
]}

Pezzella, Giuseppe

机构：

[1] Sun Yat Sen Univ, Sch Syst Sci & Engn, Guangzhou 510006, Peoples R China

来源：

AEROSPACE | 2023年 / 10卷 / 11期

关键词：

hypersonic entry; inverse reinforcement learning; few datasets; autonomous guidance; real-time optimal control; TRAJECTORY OPTIMIZATION; CONVEX-OPTIMIZATION;

D O I：

10.3390/aerospace10110948

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The mission of hypersonic vehicles faces the problem of highly nonlinear dynamics and complex environments, which presents challenges to the intelligent level and real-time performance of onboard guidance algorithms. In this paper, inverse reinforcement learning is used to address the hypersonic entry guidance problem. The state-control sample pairs and state-rewards sample pairs obtained by interacting with hypersonic entry dynamics are used to train the neural network by applying the distributed proximal policy optimization method. To overcome the sparse reward problem in the hypersonic entry problem, a novel reward function combined with a sophisticated discriminator network is designed to generate dense optimal rewards continuously, which is the main contribution of this paper. The optimized guidance methodology can achieve good terminal accuracy and high success rates with a small number of trajectories as datasets while satisfying heating rate, overload, and dynamic pressure constraints. The proposed guidance method is employed for two typical hypersonic entry vehicles (Common Aero Vehicle-Hypersonic and Reusable Launch Vehicle) to demonstrate the feasibility and potential. Numerical simulation results validate the real-time performance and optimality of the proposed method and indicate its suitability for onboard applications in the hypersonic entry flight.

引用

页数：19

共 50 条

[21] Real-time Optimal Allocation for Uncertain Time Coupled Resource Based on Reinforcement Learning
Huang, Qilong
Yang, Li
Chi, Cheng
Kong, XiangGuang
Zhou, Cangqi
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1264 - 1269
[22] Real-time Energy Management of Microgrid Using Reinforcement Learning
Bi, Wenzheng
Shu, Yuankai
Dong, Wei
Yang, Qiang
2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 38 - 41
[23] Simulation of reinforcement learning based real-time guidance of proton therapy for mobile tumors
Ghislain, Melanie
Dasnoy, Damien
Macq, Benoit
RADIOTHERAPY AND ONCOLOGY, 2024, 194 : S4492 - S4495
[24] Real-time Guidance Strategy for Active Defense Aircraft via Deep Reinforcement Learning
Li, Zhi
Wu, Jinze
Wu, Yuanpei
Zheng, Yu
Li, Meng
Liang, Haizhao
PROCEEDINGS OF THE 2021 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2021, : 177 - 183
[25] Graph reinforcement learning for real-time optimal dispatch of active distribution network
Chen J.-B.
Yu T.
Pan Z.-N.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (06): : 999 - 1008
[26] Reinforcement learning: A new technique for the real-time optimal control of hydraulic networks
Wilson, G
HYDROINFORMATICS '96, VOLS 1 AND 2, 1996, : 893 - 900
[27] Application of Reinforcement Learning for Real-Time Optimal Control of the Pellet Induration Process
Jayasree Biswas
Akash Goyal
Balaji Selvanathan
Sri Harsha Nistala
Venkataramana Runkana
Transactions of the Indian Institute of Metals, 2022, 75 : 2539 - 2546
[28] Application of Reinforcement Learning for Real-Time Optimal Control of the Pellet Induration Process
Biswas, Jayasree
Goyal, Akash
Selvanathan, Balaji
Nistala, Sri Harsha
Runkana, Venkataramana
TRANSACTIONS OF THE INDIAN INSTITUTE OF METALS, 2022, 75 (10) : 2539 - 2546
[29] A real-time tracking controller for piezoelectric actuators based on reinforcement learning and inverse compensation
Qin, Shijie
Cheng, Long
SUSTAINABLE CITIES AND SOCIETY, 2021, 69
[30] A Real-Time Reentry Guidance Method for Hypersonic Vehicles Based on a Time2vec and Transformer Network
Song, Jia
Tong, Xindi
Xu, Xiaowei
Zhao, Kai
AEROSPACE, 2022, 9 (08)

← 1 2 3 4 5 →