Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles

被引:30
|
作者
Zhao, Shangwei [1 ,2 ]
Wang, Jingcheng [1 ,2 ]
Xu, Haotian [3 ]
Wang, Bohui [4 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Minist Educ China, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Syst Control & Informat Proc, Minist Educ China, Shanghai 200240, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Cyber Sci & Engn, Xian 710049, Peoples R China
[5] Xidian Univ, Sch Aerosp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Hypersonic vehicles; Nonlinear dynamical systems; Optimal control; Observers; Attitude control; Aerodynamics; Vehicle dynamics; Attitude-tracking control; near-optimal control; observer design; reinforcement learning (RL); ROBUST OPTIMAL-CONTROL; NONLINEAR-SYSTEMS; EXPERIENCE REPLAY; NEURAL-NETWORK; DESIGN;
D O I
10.1109/TCYB.2022.3192871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes an observer-based reinforcement learning (RL) control approach to address the optimal attitude-tracking problem and application for hypersonic vehicles in the reentry phase. Due to the unknown uncertainty and nonlinearity caused by parameter perturbation and external disturbance, accurate model information of hypersonic vehicles in the reentry phase is generally unavailable. For this reason, a novel synchronous estimation is proposed to construct a composite observer for hypersonic vehicles, which consists of a neural-network (NN)-based Luenberger-type observer and a synchronous disturbance observer. This solves the identification problem of nonlinear dynamics in the reference control and realizes the estimation of the system state when unknown nonlinear dynamics and unknown disturbance exist at the same time. By synthesizing the information from the composite observer, an RL tracking controller is developed to solve the optimal attitude-tracking control problem. To improve the convergence performance of critic network weights, concurrent learning is employed to replace the traditional persistent excitation condition with a historical experience replay manner. In addition, this article proves that the weight estimation error is bounded when the learning rate satisfies the given sufficient condition. Finally, the numerical simulation demonstrates the effectiveness and superiority of the proposed approaches to attitude-tracking control systems for hypersonic vehicles.
引用
收藏
页码:913 / 926
页数:14
相关论文
共 50 条
  • [11] Nonlinear Attitude Control Based on Extended State Observer for Hypersonic Vehicles
    Hu, Chaofang
    Liu, Yunbing
    2014 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2014, : 2474 - 2479
  • [12] Finite-Time Extended State Observer-Based Fixed-Time Attitude Control for Hypersonic Vehicles
    Zhao, Jiaqi
    Feng, Dongzhu
    Cui, Jiashan
    Wang, Xin
    MATHEMATICS, 2022, 10 (17)
  • [13] Attitude maneuver control of flexible spacecraft by observer-based tracking control
    Hyochoong Bang
    Choong-Seok Oh
    KSME International Journal, 2004, 18 : 122 - 131
  • [14] An optimal and robust attitude-tracking control of spacecraft based on inverse system method
    School of Aerospace, Tsinghua University, Beijing 100084, China
    Gongcheng Lixue/Engineering Mechanics, 2008, 25 (02): : 214 - 218
  • [15] Attitude maneuver control of flexible spacecraft by observer-based tracking control
    Bang, H
    Oh, CS
    KSME INTERNATIONAL JOURNAL, 2004, 18 (01): : 122 - 131
  • [16] Observer-based robust integral reinforcement learning for attitude regulation of quadrotors
    Chen, Zitao
    Zhong, Weifeng
    Xie, Shengli
    Zhang, Yun
    Yuen, Chau
    KNOWLEDGE-BASED SYSTEMS, 2024, 303
  • [17] H∞ inverse optimal attitude-tracking control of rigid spacecraft
    Luo, WC
    Chu, YC
    Ling, KV
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2005, 28 (03) : 481 - 493
  • [18] Attitude control for hypersonic reentry vehicles: An efficient deep reinforcement learning method
    Liu, Yiheng
    Wang, Honglun
    Wu, Tiancai
    Lun, Yuebin
    Fan, Jiaxuan
    Wu, Jianfa
    APPLIED SOFT COMPUTING, 2022, 123
  • [19] Disturbance observer-based LI robust tracking control for hypersonic vehicles with T-S disturbance modeling
    Yi, Yang
    Xu, Lubing
    Shen, Hong
    Fan, Xiangxiang
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13 : 1 - 10
  • [20] Disturbance Observer-Based Antiwindup Control for Air-Breathing Hypersonic Vehicles
    An, Hao
    Liu, Jianxing
    Wang, Changhong
    Wu, Ligang
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2016, 63 (05) : 3038 - 3049