Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

被引:29
|
作者
Zhou, Ye [1 ,2 ]
Van Kampen, Erik-Jan [2 ]
Chu, Qiping [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
[2] Delft Univ Technol, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Zuid Holland, Netherlands
关键词
Online reinforcement learning; Heuristic dynamic programming; Adaptive nonlinear flight control; Incremental techniques; Partial observability; OUTPUT-FEEDBACK CONTROL; AIRCRAFT; SPACECRAFT; DESIGN;
D O I
10.1016/j.ast.2020.106013
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Fault tolerant tracking control for nonlinear systems with actuator failures through particle swarm optimization-based adaptive dynamic programming
    Liu, Xi
    Zhao, Bo
    Liu, Derong
    APPLIED SOFT COMPUTING, 2020, 97
  • [32] Adaptive Neural Controller Design for Synchronous Generator Based on Heuristic Dynamic Programming
    Song Shao-jian
    Li Xiao-qiang
    Lin Xiao-feng
    Liao Bi-lin
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 2161 - 2166
  • [33] Ecological Adaptive Cruise Control and Energy Management Strategy for Hybrid Electric Vehicles Based on Heuristic Dynamic Programming
    Li, Guoqiang
    Goerges, Daniel
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (09) : 3526 - 3535
  • [34] Pattern-Moving-Based Partial Form Dynamic Linearization Model Free Adaptive Control for a Class of Nonlinear Systems
    Li, Xiangquan
    Xu, Zhengguang
    ACTUATORS, 2021, 10 (09)
  • [35] Predictive Event-Triggered Control based on Heuristic Dynamic Programming for Nonlinear Continuous-Time Systems
    Dong, Lu
    Zhong, Xiangnan
    Sun, Changyin
    He, Haibo
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [36] Nonlinear and Adaptive Suboptimal Control of Connected Vehicles: A Global Adaptive Dynamic Programming Approach
    Gao, Weinan
    Jiang, Zhong-Ping
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (3-4) : 597 - 611
  • [37] An Improved Reinforcement Learning Based Heuristic Dynamic Programming Algorithm for Model-Free Optimal Control
    Li, Jia
    Yuan, Zhaolin
    Ban, Xiaojuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 282 - 294
  • [38] Optimal control for nonlinear continuous systems by adaptive dynamic programming based on fuzzy basis functions
    Zhang, Jilie
    Liang, Hongjing
    Feng, Tao
    APPLIED MATHEMATICAL MODELLING, 2016, 40 (13-14) : 6766 - 6774
  • [39] Event-triggered adaptive dynamic programming for decentralized tracking control of input constrained unknown nonlinear interconnected systems
    Wu, Qiuye
    Zhao, Bo
    Liu, Derong
    Polycarpou, Marios M.
    NEURAL NETWORKS, 2023, 157 (336-349) : 336 - 349
  • [40] Trajectory tracking control for underactuated autonomous vehicles via adaptive dynamic programming
    Han, Xiumei
    Zhao, Xudong
    Xu, Xiaolu
    Mei, Congli
    Xing, Wei
    Wang, Xinwei
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (01): : 474 - 488