Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

被引:29
|
作者
Zhou, Ye [1 ,2 ]
Van Kampen, Erik-Jan [2 ]
Chu, Qiping [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
[2] Delft Univ Technol, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Zuid Holland, Netherlands
关键词
Online reinforcement learning; Heuristic dynamic programming; Adaptive nonlinear flight control; Incremental techniques; Partial observability; OUTPUT-FEEDBACK CONTROL; AIRCRAFT; SPACECRAFT; DESIGN;
D O I
10.1016/j.ast.2020.106013
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Incremental Approximate Dynamic Programming for Nonlinear Adaptive Tracking Control with Partial Observability
    Zhou, Ye
    van Kampen, Erik-Jan
    Chu, QiPing
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2018, 41 (12) : 2554 - 2567
  • [2] Intelligent adaptive optimal control using incremental model-based global dual heuristic programming subject to partial observability
    Sun, Bo
    van Kampen, Erik-Jan
    APPLIED SOFT COMPUTING, 2021, 103
  • [3] Nonlinear Adaptive Flight Control Using Incremental Approximate Dynamic Programming and Output Feedback
    Zhou, Ye
    van Kampen, Erik-Jan
    Chu, QiPing
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (02) : 493 - 500
  • [4] Adaptive Online Data-Driven Tracking Control for Highly Flexible Aircrafts With Partial Observability
    Peng, Chi
    Ma, Jianjun
    IEEE ACCESS, 2020, 8 : 192844 - 192856
  • [5] Adaptive Event-Triggered Control Based on Heuristic Dynamic Programming for Nonlinear Discrete-Time Systems
    Dong, Lu
    Zhong, Xiangnan
    Sun, Changyin
    He, Haibo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) : 1594 - 1605
  • [6] Nonlinear trajectory-tracking control for autonomous underwater vehicle based on iterative adaptive dynamic programming
    Che, Gaofeng
    Liu, Lijun
    Yu, Zhen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (03) : 4205 - 4215
  • [7] Robust Optimal Parallel Tracking Control Based on Adaptive Dynamic Programming
    Wei, Qinglai
    Jiao, Shanshan
    Wang, Fei-Yue
    Dong, Qi
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (07) : 4308 - 4321
  • [8] Robust tracking control of nonlinear unmatched uncertain systems via event-based adaptive dynamic programming
    Dahal, Raju
    Kar, Indrani
    NONLINEAR DYNAMICS, 2022, 109 (04) : 2831 - 2850
  • [9] Tracking Control for a Quadrotor via Dynamic Surface Control and Adaptive Dynamic Programming
    Gao, Qiang
    Wei, Xin-Tong
    Li, Da-Hua
    Ji, Yue-Hui
    Jia, Chao
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (01) : 349 - 363
  • [10] Adaptive dynamic programming based composite control for profile tracking with multiple constraints
    Tang, Rui
    Luo, Biao
    Liao, Yuxin
    NEUROCOMPUTING, 2023, 557