Two-Loop Acceleration Autopilot Design and Analysis Based on TD3 Strategy

被引:0
|
作者
Fan, Junfang [1 ,2 ]
Dou, Denghui [1 ,2 ]
Ji, Yi [1 ]
Liu, Ning [2 ]
Chen, Shiwei [3 ]
Yan, Huajie [1 ,2 ]
Li, Junxian [1 ,2 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Beijing Key Lab High Dynam Nav Technol, Beijing 100192, Peoples R China
[3] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Acceleration autopilots - Autopilot designs - Control parameters - Design and analysis - Design-process - Deterministics - Fitting model - Loop acceleration - Policy gradient - Tactical missiles;
D O I
10.1155/2023/5759135
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A two-loop acceleration autopilot is designed using the twin-delayed deep deterministic policy gradient (TD3) strategy to avoid the tedious design process of conventional tactical missile acceleration autopilots and the difficulty of meeting the performance requirements of the full flight envelope. First, a deep reinforcement learning model for the two-loop autopilot is developed. The flight state information serves as the state, the to-be-designed autopilot control parameters serve as the action, and a reward mechanism based on the stability margin index is designed. The TD3 strategy is subsequently used to offline learn the control parameters for the entire flight envelope. An autopilot control parameter fitting model that can be directly applied to the guidance loop is obtained. Finally, the obtained fitting model is combined with the impact angle constraint in the guidance system and verified online. The simulation results demonstrate that the autopilot based on the TD3 strategy can self-adjust the control parameters online based on the real-time flight state, ensuring system stability and achieving accurate acceleration command tracking.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] GRU-Attention based TD3 Network for Mobile Robot Navigation
    Jia, Jiayao
    Xing, Xiaowei
    Chang, Dong Eui
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1642 - 1647
  • [42] Mobile robot navigation based on intrinsic reward mechanism with TD3 algorithm
    Yang, Jianan
    Liu, Yu
    Zhang, Jie
    Guan, Yong
    Shao, Zhenzhou
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [43] Reinforcement Learning Control of Hydraulic Servo System Based on TD3 Algorithm
    Yuan, Xiaoming
    Wang, Yu
    Zhang, Ruicong
    Gao, Qiang
    Zhou, Zhuangding
    Zhou, Rulin
    Yin, Fengyuan
    MACHINES, 2022, 10 (12)
  • [44] Speed Control of IM Using RL-Based TD3 Agent
    Korpe, Ugur Ufuk
    Gokdag, Mustafa
    Gulbudak, Ozan
    PROCEEDINGS 2024 IEEE 6TH GLOBAL POWER, ENERGY AND COMMUNICATION CONFERENCE, IEEE GPECOM 2024, 2024, : 173 - 178
  • [45] Speed Optimization Control of a Permanent Magnet Synchronous Motor Based on TD3
    Hu, Zuolei
    Zhang, Yingjie
    Li, Ming
    Liao, Yuhua
    ENERGIES, 2025, 18 (04)
  • [46] Charging Station Management Strategy for Returns Maximization via Improved TD3 Deep Reinforcement Learning
    Li, Hengjie
    Zhu, Jianghao
    Zhou, Yun
    Feng, Qi
    Feng, Donghan
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2022, 2022
  • [47] The SU(2|3) dynamic two-loop form factors
    Brandhuber, A.
    Kostacinska, M.
    Penante, B.
    Travaglini, G.
    Young, D.
    JOURNAL OF HIGH ENERGY PHYSICS, 2016, (08):
  • [48] Charging Station Management Strategy for Returns Maximization via Improved TD3 Deep Reinforcement Learning
    Li, Hengjie
    Zhu, Jianghao
    Zhou, Yun
    Feng, Qi
    Feng, Donghan
    International Transactions on Electrical Energy Systems, 2022, 2022
  • [49] A Two-Loop Coupled Interaction System Design for Autonomous Driving Scenarios
    Cui, Mingyu
    Zhang, Yahui
    Zhang, Kejia
    HCI IN MOBILITY, TRANSPORT, AND AUTOMOTIVE SYSTEMS, MOBITAS 2024, PT I, 2024, 14732 : 101 - 115
  • [50] Path planning of mobile robot based on improved TD3 algorithm in dynamic environment
    Li, Peng
    Chen, Donghui
    Wang, Yuchen
    Zhang, Lanyong
    Zhao, Shiquan
    HELIYON, 2024, 10 (11)