Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

被引:29
|
作者
Zhou, Ye [1 ,2 ]
Van Kampen, Erik-Jan [2 ]
Chu, Qiping [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
[2] Delft Univ Technol, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Zuid Holland, Netherlands
关键词
Online reinforcement learning; Heuristic dynamic programming; Adaptive nonlinear flight control; Incremental techniques; Partial observability; OUTPUT-FEEDBACK CONTROL; AIRCRAFT; SPACECRAFT; DESIGN;
D O I
10.1016/j.ast.2020.106013
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming
    Yang, Xiong
    Liu, Derong
    Wei, Qinglai
    Wang, Ding
    NEUROCOMPUTING, 2016, 198 : 80 - 90
  • [22] Online optimal control for dynamic positioning of vessels via time-based adaptive dynamic programming
    Gao, Xiaoyang
    Li, Tieshan
    Shan, Qihe
    Xiao, Yang
    Yuan, Liang'en
    Liu, Yifan
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 14 (12) : 15629 - 15641
  • [23] Observer-Based Adaptive Fuzzy Control for Switched Stochastic Nonlinear Systems With Partial Tracking Errors Constrained
    Sui, Shuai
    Li, Yongming
    Tong, Shaocheng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 46 (12): : 1605 - 1617
  • [24] Dynamic model based nonlinear tracking control of a planar parallel manipulator
    Shang, Wei-wei
    Cong, Shuang
    Jiang, Shi-long
    NONLINEAR DYNAMICS, 2010, 60 (04) : 597 - 606
  • [25] Incremental Dual Heuristic Dynamic Programming Based Hybrid Approach for Multi-Channel Control of Unstable Tailless Aircraft
    Li, Hangxu
    Sun, Liguo
    Tan, Wenqian
    Liu, Xiaoyu
    Dang, Weigao
    IEEE ACCESS, 2022, 10 : 31677 - 31691
  • [26] Adaptive output feedback tracking control of stochastic nonlinear systems with dynamic uncertainties
    Zhang, Tianping
    Xia, Xiaonan
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (09) : 1282 - 1300
  • [27] Observer based adaptive dynamic programming for fault tolerant control of a class of nonlinear systems
    Zhao, Bo
    Liu, Derong
    Li, Yuanchun
    INFORMATION SCIENCES, 2017, 384 : 21 - 33
  • [28] Zero-sum game optimal control for the nonlinear switched systems based on heuristic dynamic programming
    Fu, Xingjian
    Li, Zizheng
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05) : 2821 - 2837
  • [29] Adaptive tracking control of nonlinear systems with dynamic uncertainties using neural network
    Han, Yu-Qun
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2018, 49 (07) : 1391 - 1402
  • [30] Attitude Tracking Control for a Quadrotor via backstepping and Adaptive Dynamic Programming
    Chi, Wenhao
    Ji, Yuehui
    Gao, Qiang
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3108 - 3113