Incremental Model-Based Heuristic Dynamic Programming with Output Feedback Applied to Aerospace System Identification and Control

被引：0

作者：

Sun, Bo ^{[1
]}

Van Kampen, Erik-Jan ^{[1
]}

机构：

[1] Delft Univ Technol, Dept Control & Operat, NL-2629 HS Delft, Netherlands

来源：

2020 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA) | 2020年

关键词：

TRACKING CONTROL; TIME-SYSTEMS;

D O I：

10.1109/ccta41146.2020.9206261

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sufficient information about system dynamics and inner states is often unavailable to aerospace system controllers, which requires model-free and output feedback control techniques, respectively. This paper presents a novel self-learning control algorithm to deal with these two problems by combining the advantages of heuristic dynamic programming and incremental modeling. The system dynamics is completely unknown and only input/output data can be acquired. The controller identifies the local system models and learns control polices online both by tuning the weights of neural networks. The novel method has been applied to a multi-input multi-output nonlinear satellite attitude tracking control problem. The simulation results demonstrate that, compared with the conventional actor-critic-identifier-based heuristic dynamic programming algorithm with three networks, the proposed adaptive control algorithm improves online identification of the nonlinear system with respect to precision and speed of convergence, while maintaining similar performance compared to the full state feedback situation.

引用

页码：366 / 371

页数：6

共 17 条

[1] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
Bhasin, S.
Kamalapurkar, R.
Johnson, M.
Vamvoudakis, K. G.
Lewis, F. L.
Dixon, W. E.
[J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
[2] Satellite Attitude Control System Design considering the Fuel Slosh Dynamics
Gadelha de Souza, Luiz Carlos
de Souza, Alain G.
[J]. SHOCK AND VIBRATION, 2014, 2014
[3] Efficient Model Learning Methods for Actor-Critic Control
Grondman, Ivo
Vaandrager, Maarten
Busoniu, Lucian
Babuska, Robert
Schuitema, Erik
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
[4] Adaptive Three-Step Kalman Filter for Air Data Sensor Fault Detection and Diagnosis
Lu, P.
Van Eykeren, L.
van Kampen, E.
de Visser, C. C.
Chu, Q. P.
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2016, 39 (03) : 590 - 604
[5] H∞ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning
Modares, Hamidreza
Lewis, Frank L.
Jiang, Zhong-Ping
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2550 - 2562
[6] Incremental model-based global dual heuristic programming with explicit analytical calculations applied to flight control
Sun, Bo
van Kampen, Erik-Jan
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 89 (89)
[7] Incremental Model-Based Global Dual Heuristic Programming for Flight Control
Sun, Bo
van Kampen, Erik-Jan
[J]. IFAC PAPERSONLINE, 2019, 52 (29): : 7 - 12
[8] Van Kampen E, 2006, PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P256
[9] Self-Learning Optimal Regulation for Discrete-Time Nonlinear Systems Under Event-Driven Formulation
Wang, Ding
Ha, Mingming
Qiao, Junfei
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (03) : 1272 - 1279
[10] Adaptive Critic Nonlinear Robust Control: A Survey
Wang, Ding
He, Haibo
Liu, Derong
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3429 - 3451

← 1 2 →