Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

被引:29
|
作者
Zhou, Ye [1 ,2 ]
Van Kampen, Erik-Jan [2 ]
Chu, Qiping [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
[2] Delft Univ Technol, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Zuid Holland, Netherlands
关键词
Online reinforcement learning; Heuristic dynamic programming; Adaptive nonlinear flight control; Incremental techniques; Partial observability; OUTPUT-FEEDBACK CONTROL; AIRCRAFT; SPACECRAFT; DESIGN;
D O I
10.1016/j.ast.2020.106013
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Adaptive Neural Tracking Control for Nonlinear Switched Systems with Dynamic Uncertainties
    Zhou, Wanlu
    Li, Huan
    Niu, Ben
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 3932 - 3937
  • [42] Model-Free Composite Control of Flexible Manipulators Based on Adaptive Dynamic Programming
    Yang, Chunyu
    Xu, Yiming
    Zhou, Linna
    Sun, Yongzheng
    COMPLEXITY, 2018,
  • [43] Robust tracking control of quadrotor via on-policy adaptive dynamic programming
    Dou, Liqian
    Su, Xiaotong
    Zhao, Xinyi
    Zong, Qun
    He, Lei
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (07) : 2509 - 2525
  • [44] Modeling and Trajectory Tracking Control for Magnetic Wheeled Mobile Robots Based on Improved Dual-Heuristic Dynamic Programming
    Dian, Songyi
    Fang, Hongwei
    Zhao, Tao
    Wu, Qing
    Hu, Yi
    Guo, Rui
    Li, Shengchuan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (02) : 1470 - 1482
  • [45] Robust Attitude Tracking Control Based on Adaptive Dynamic Programming for Flexible Dumbbell-Shaped Spacecraft
    Huang, Wenke
    Ran, Guangtao
    Wang, Bohui
    Li, Dongyu
    Dong, Wenye
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 2394 - 2406
  • [46] Adaptive NN Tracking Control for Pure-Feedback Stochastic Nonlinear Systems Based on Dynamic Surface Control
    Cui Guozeng
    Zhang Baoyong
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8735 - 8740
  • [47] A novel adaptive heuristic dynamic programming-based algorithm for aircraft confrontation games
    Mao, Yi
    Chen, Zhijie
    Yang, Yi
    Hu, Yuxin
    FUNDAMENTAL RESEARCH, 2021, 1 (06): : 792 - 799
  • [48] RLS Algorithms and Convergence Analysis Method for Online DLQR Control Design via Heuristic Dynamic Programming
    Santos, Watson R. M.
    Queiroz, Jonathan A.
    Neto, Joao Viana da F.
    Rego, Patricia H. M.
    Santana, Ewaldo
    Andrade, Gustavo
    2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 77 - 83
  • [49] Adaptive tracking control of uncertain MIMO nonlinear systems based on generalized fuzzy hyperbolic model
    Cui, Yang
    Zhang, Huaguang
    Wang, Yingchun
    FUZZY SETS AND SYSTEMS, 2017, 306 : 105 - 117
  • [50] Multi-loop adaptive internal model control based on a dynamic partial least squares model
    Zhao, Zhao
    Hu, Bin
    Liang, Jun
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2011, 12 (03): : 190 - 200