Deep reinforcement learning based trajectory real-time planning for hypersonic gliding vehicles

被引:0
|
作者
Li, Jianfeng [1 ]
Song, Shenmin [1 ]
Shi, Xiaoping [2 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, 92 West Dazhi St, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Control & Simulat Ctr, Harbin, Peoples R China
关键词
Deep reinforcement learning; hypersonic gliding vehicle; trajectory planning; TD3; miss distance; PARTICLE SWARM OPTIMIZATION; ENTRY GUIDANCE; UAV;
D O I
10.1177/09544100241278023
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
To overcome the shortcomings of traditional NLP methods for trajectory planning problems, an intelligent trajectory real-time planning method is designed for hypersonic gliding vehicles (HGVs), which is composed of two stages: the agent training stage and the real-time trajectory generation stage. During the training stage, the HGV model is considered as an agent, and an environment containing flight information and relative information is constructed. Given the trajectory planning problem possessing continuous state-action space, the twin delayed deep deterministic policy gradient (TD3) is employed, based on which the HGV agent is trained in the environment. To match the real flight environment for HGVs, the process and terminal constraints are taken into consideration, such as the limit of dynamic pressure, overload, and the terminal miss distance, etc. The reward shaping technique is adopted to tackle the multiple constraints. The second stage is the real-time trajectory generation stage, during which a trajectory satisfying the multiple constraints is generated online by the TD3-based method. The simulation results verify the effectiveness of the proposed method.
引用
收藏
页码:1665 / 1682
页数:18
相关论文
共 50 条
  • [1] Trajectory Planning for Hypersonic Vehicles with Reinforcement Learning
    Chi, Haihong
    Thou, Mingxin
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 3721 - 3726
  • [2] Deep Reinforcement Learning for Real-Time Trajectory Planning in UAV Networks
    Li, Kai
    Ni, Wei
    Tovar, Eduardo
    Guizani, Mohsen
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 958 - 963
  • [3] A Real-Time Trajectory Optimization Method for Hypersonic Vehicles Based on a Deep Neural Network
    Wang, Jianying
    Wu, Yuanpei
    Liu, Ming
    Yang, Ming
    Liang, Haizhao
    AEROSPACE, 2022, 9 (04)
  • [4] Real-time Dispatch Strategy for Electric Vehicles Based on Deep Reinforcement Learning
    Li H.
    Li G.
    Wang K.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (22): : 161 - 167
  • [5] A deep reinforcement learning-based approach to onboard trajectory generation for hypersonic vehicles
    Bao, C. Y.
    Zhou, X.
    Wang, P.
    He, R. Z.
    Tang, G. J.
    AERONAUTICAL JOURNAL, 2023, 127 (1315): : 1638 - 1658
  • [6] Reinforcement Learning-Based 3D Trajectory Tracking Control of Hypersonic Gliding Vehicles With Time-Varying Uncertainties
    Luo, Biao
    Sun, Jingyi
    Tang, Rui
    Xu, Xiaodong
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 8187 - 8199
  • [7] Real-Time Battery Thermal Management for Electric Vehicles Based on Deep Reinforcement Learning
    Huang, Gan
    Zhao, Ping
    Zhang, Guanglin
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (15) : 14060 - 14072
  • [8] Real-Time Planning of Route, Speed, and Charging for Electric Delivery Vehicles: A Deep Reinforcement Learning Approach
    Bi, Xiaowen
    Shen, Minyu
    Gu, Weihua
    Chung, Edward
    Wang, Yuhong
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2025, 11 (02): : 7066 - 7082
  • [9] Time optimal trajectory planning of excavator based on deep reinforcement learning
    Zhang Y.-Y.
    Sun Z.-Y.
    Sun Q.-L.
    Wang Y.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (05): : 1433 - 1440
  • [10] Real-time local path planning strategy based on deep distributional reinforcement learning
    Du, Shengli
    Zhu, Zexing
    Wang, Xuefang
    Han, Honggui
    Qiao, Junfei
    NEUROCOMPUTING, 2024, 599