Incremental Model-Based Heuristic Dynamic Programming with Output Feedback Applied to Aerospace System Identification and Control

被引:0
作者
Sun, Bo [1 ]
Van Kampen, Erik-Jan [1 ]
机构
[1] Delft Univ Technol, Dept Control & Operat, NL-2629 HS Delft, Netherlands
来源
2020 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA) | 2020年
关键词
TRACKING CONTROL; TIME-SYSTEMS;
D O I
10.1109/ccta41146.2020.9206261
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sufficient information about system dynamics and inner states is often unavailable to aerospace system controllers, which requires model-free and output feedback control techniques, respectively. This paper presents a novel self-learning control algorithm to deal with these two problems by combining the advantages of heuristic dynamic programming and incremental modeling. The system dynamics is completely unknown and only input/output data can be acquired. The controller identifies the local system models and learns control polices online both by tuning the weights of neural networks. The novel method has been applied to a multi-input multi-output nonlinear satellite attitude tracking control problem. The simulation results demonstrate that, compared with the conventional actor-critic-identifier-based heuristic dynamic programming algorithm with three networks, the proposed adaptive control algorithm improves online identification of the nonlinear system with respect to precision and speed of convergence, while maintaining similar performance compared to the full state feedback situation.
引用
收藏
页码:366 / 371
页数:6
相关论文
共 17 条
  • [1] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    [J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [2] Satellite Attitude Control System Design considering the Fuel Slosh Dynamics
    Gadelha de Souza, Luiz Carlos
    de Souza, Alain G.
    [J]. SHOCK AND VIBRATION, 2014, 2014
  • [3] Efficient Model Learning Methods for Actor-Critic Control
    Grondman, Ivo
    Vaandrager, Maarten
    Busoniu, Lucian
    Babuska, Robert
    Schuitema, Erik
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
  • [4] Adaptive Three-Step Kalman Filter for Air Data Sensor Fault Detection and Diagnosis
    Lu, P.
    Van Eykeren, L.
    van Kampen, E.
    de Visser, C. C.
    Chu, Q. P.
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2016, 39 (03) : 590 - 604
  • [5] H∞ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning
    Modares, Hamidreza
    Lewis, Frank L.
    Jiang, Zhong-Ping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2550 - 2562
  • [6] Incremental model-based global dual heuristic programming with explicit analytical calculations applied to flight control
    Sun, Bo
    van Kampen, Erik-Jan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 89 (89)
  • [7] Incremental Model-Based Global Dual Heuristic Programming for Flight Control
    Sun, Bo
    van Kampen, Erik-Jan
    [J]. IFAC PAPERSONLINE, 2019, 52 (29): : 7 - 12
  • [8] Van Kampen E, 2006, PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P256
  • [9] Self-Learning Optimal Regulation for Discrete-Time Nonlinear Systems Under Event-Driven Formulation
    Wang, Ding
    Ha, Mingming
    Qiao, Junfei
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (03) : 1272 - 1279
  • [10] Adaptive Critic Nonlinear Robust Control: A Survey
    Wang, Ding
    He, Haibo
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3429 - 3451