Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles

被引:30
|
作者
Zhao, Shangwei [1 ,2 ]
Wang, Jingcheng [1 ,2 ]
Xu, Haotian [3 ]
Wang, Bohui [4 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Minist Educ China, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Syst Control & Informat Proc, Minist Educ China, Shanghai 200240, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Cyber Sci & Engn, Xian 710049, Peoples R China
[5] Xidian Univ, Sch Aerosp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Hypersonic vehicles; Nonlinear dynamical systems; Optimal control; Observers; Attitude control; Aerodynamics; Vehicle dynamics; Attitude-tracking control; near-optimal control; observer design; reinforcement learning (RL); ROBUST OPTIMAL-CONTROL; NONLINEAR-SYSTEMS; EXPERIENCE REPLAY; NEURAL-NETWORK; DESIGN;
D O I
10.1109/TCYB.2022.3192871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes an observer-based reinforcement learning (RL) control approach to address the optimal attitude-tracking problem and application for hypersonic vehicles in the reentry phase. Due to the unknown uncertainty and nonlinearity caused by parameter perturbation and external disturbance, accurate model information of hypersonic vehicles in the reentry phase is generally unavailable. For this reason, a novel synchronous estimation is proposed to construct a composite observer for hypersonic vehicles, which consists of a neural-network (NN)-based Luenberger-type observer and a synchronous disturbance observer. This solves the identification problem of nonlinear dynamics in the reference control and realizes the estimation of the system state when unknown nonlinear dynamics and unknown disturbance exist at the same time. By synthesizing the information from the composite observer, an RL tracking controller is developed to solve the optimal attitude-tracking control problem. To improve the convergence performance of critic network weights, concurrent learning is employed to replace the traditional persistent excitation condition with a historical experience replay manner. In addition, this article proves that the weight estimation error is bounded when the learning rate satisfies the given sufficient condition. Finally, the numerical simulation demonstrates the effectiveness and superiority of the proposed approaches to attitude-tracking control systems for hypersonic vehicles.
引用
收藏
页码:913 / 926
页数:14
相关论文
共 50 条
  • [1] ADP-Based Attitude-Tracking Control With Prescribed Performance for Hypersonic Vehicles
    Zhao, Shangwei
    Wang, Jingcheng
    Xu, Haotian
    Wang, Bohui
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (05) : 6419 - 6431
  • [2] Disturbance Observer-Based Attitude Control for Hypersonic Vehicles with Input Constraint
    Han, Xudong
    Qiao, Jianzhong
    Du, Qingyu
    Chen, Yicheng
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8349 - 8354
  • [3] Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles
    Han, Xiao
    Zheng, Zongzhun
    Liu, Lei
    Wang, Bo
    Cheng, Zhongtao
    Fan, Huijin
    Wang, Yongji
    AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 106
  • [4] Observer-based linear parameter varying H∞ tracking control for hypersonic vehicles
    Huang, Yiqing
    Shao, Shouchen
    Peng, Kai
    Jiang, Ming
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13 : 1 - 8
  • [5] Observer-Based Attitude Control for Hypersonic Gliding Vehicle
    Gao, Kehan
    Li, Qingdong
    Ren, Zhang
    Bai, Chen
    2014 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2014, : 1159 - 1162
  • [6] Neural network observer-based predefined-time attitude control for morphing hypersonic vehicles
    Lu, Xinyue
    Wang, Jianying
    Wang, Yonghai
    Chen, Jun
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 152
  • [7] Fuzzy Disturbance Observer-Based Fixed-Time Attitude Control for Hypersonic Morphing Vehicles
    Zhang, Hao
    Wang, Peng
    Tang, Guojian
    Bao, Weimin
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (05) : 6577 - 6593
  • [8] Disturbance Observer-Based Adaptive Control of Hypersonic Vehicles with Constrained Actuators
    Wang, Guan
    Li, Li
    Li, Weihua
    Zhou, Huajun
    Ma, Changbo
    Xia, Hongwei
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2020, 2020
  • [9] Disturbance Observer-Based Backstepping Control for Hypersonic Vehicles with Input Constraints
    Zhong, Qin
    Zhao, Jiaqi
    Liu, Ruifan
    Du, Wei
    Dai, Pei
    JOURNAL OF AEROSPACE ENGINEERING, 2024, 37 (01)
  • [10] Attitude Control of Hypersonic Vehicle based on Reinforcement Learning
    Liu, Jingwen
    Fan, Hongdong
    Fan, Yonghua
    Cai, Guangbin
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 1503 - 1507