Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

被引：29

作者：

Zhou, Ye ^{[1
,2
]}

Van Kampen, Erik-Jan ^{[2
]}

Chu, Qiping ^{[2
]}

机构：

[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia

[2] Delft Univ Technol, Fac Aerosp Engn, Kluyverweg 1, NL-2629 HS Delft, Zuid Holland, Netherlands

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2020年 / 105卷

关键词：

Online reinforcement learning; Heuristic dynamic programming; Adaptive nonlinear flight control; Incremental techniques; Partial observability; OUTPUT-FEEDBACK CONTROL; AIRCRAFT; SPACECRAFT; DESIGN;

D O I：

10.1016/j.ast.2020.106013

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances. (C) 2020 Elsevier Masson SAS. All rights reserved.

引用

页数：14

共 50 条

[41] Adaptive Neural Tracking Control for Nonlinear Switched Systems with Dynamic Uncertainties
Zhou, Wanlu
Li, Huan
Niu, Ben
PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 3932 - 3937
[42] Model-Free Composite Control of Flexible Manipulators Based on Adaptive Dynamic Programming
Yang, Chunyu
Xu, Yiming
Zhou, Linna
Sun, Yongzheng
COMPLEXITY, 2018,
[43] Robust tracking control of quadrotor via on-policy adaptive dynamic programming
Dou, Liqian
Su, Xiaotong
Zhao, Xinyi
Zong, Qun
He, Lei
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (07) : 2509 - 2525
[44] Modeling and Trajectory Tracking Control for Magnetic Wheeled Mobile Robots Based on Improved Dual-Heuristic Dynamic Programming
Dian, Songyi
Fang, Hongwei
Zhao, Tao
Wu, Qing
Hu, Yi
Guo, Rui
Li, Shengchuan
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (02) : 1470 - 1482
[45] Robust Attitude Tracking Control Based on Adaptive Dynamic Programming for Flexible Dumbbell-Shaped Spacecraft
Huang, Wenke
Ran, Guangtao
Wang, Bohui
Li, Dongyu
Dong, Wenye
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 2394 - 2406
[46] Adaptive NN Tracking Control for Pure-Feedback Stochastic Nonlinear Systems Based on Dynamic Surface Control
Cui Guozeng
Zhang Baoyong
2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8735 - 8740
[47] A novel adaptive heuristic dynamic programming-based algorithm for aircraft confrontation games
Mao, Yi
Chen, Zhijie
Yang, Yi
Hu, Yuxin
FUNDAMENTAL RESEARCH, 2021, 1 (06): : 792 - 799
[48] RLS Algorithms and Convergence Analysis Method for Online DLQR Control Design via Heuristic Dynamic Programming
Santos, Watson R. M.
Queiroz, Jonathan A.
Neto, Joao Viana da F.
Rego, Patricia H. M.
Santana, Ewaldo
Andrade, Gustavo
2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 77 - 83
[49] Adaptive tracking control of uncertain MIMO nonlinear systems based on generalized fuzzy hyperbolic model
Cui, Yang
Zhang, Huaguang
Wang, Yingchun
FUZZY SETS AND SYSTEMS, 2017, 306 : 105 - 117
[50] Multi-loop adaptive internal model control based on a dynamic partial least squares model
Zhao, Zhao
Hu, Bin
Liang, Jun
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2011, 12 (03): : 190 - 200

← 1 2 3 4 5 →