Incremental model based online dual heuristic programming for nonlinear adaptive control

被引:52
作者
Zhou, Ye [1 ]
van Kampen, Erik-Jan [1 ]
Chu, Qi Ping [1 ]
机构
[1] Delft Univ Technol, Kluyverweg 1, NL-2629 HS Delft, Netherlands
关键词
Reinforcement learning; Online learning; Dual heuristic programming; Adaptive control; Nonlinear control; FLIGHT CONTROL; DYNAMIC INVERSION; REINFORCEMENT; DESIGNS; CRITICS;
D O I
10.1016/j.conengprac.2017.12.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dual heuristic programming has gained an increasing interest in recent years because it provides an effective process for optimal adaptive control of uncertain nonlinear systems. However, it requires an off-line stage to train a global system model from a representative model, which is often infeasible to obtain in practice. This paper presents a new and efficient approach for online self-learning control based on dual heuristic programming. This method uses a recursive least square method to online identify an incremental model of the system instead of a global system model. The presented incremental model based dual heuristic programming method can adaptively generate a near-optimal controller online without a priori information of the system dynamics or an off-line training stage. To compare the online adaptability of the conventional dual heuristic programming method and the newly proposed method, two numerical experiments are performed: an online reference tracking task and a fault-tolerant control task. The results reveal that the proposed method outperforms the conventional dual heuristic programming method in online learning capacity, efficiency, accuracy, and robustness. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:13 / 25
页数:13
相关论文
共 36 条
[1]  
Acquatella P. J., 2013, P EUROGNC 2013
[2]  
[Anonymous], 2002, ADAPTIVE FILTER THEO
[3]  
[Anonymous], 2020, Reinforcement Learning, An Introduction
[4]   Helicopter trimming and tracking control using direct neural dynamic programming [J].
Enns, R ;
Si, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (04) :929-939
[5]   Backstepping-based flight control with adaptive function approximation [J].
Farrell, J ;
Sharma, M ;
Polycarpou, M .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2005, 28 (06) :1089-1102
[6]   Online adaptive critic flight control [J].
Ferrari, S ;
Stengel, RF .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2004, 27 (05) :777-786
[7]   Continuous-time adaptive critics [J].
Hanselmann, Thomas ;
Noakes, Lyle ;
Zaknich, Anthony .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (03) :631-647
[8]   Reinforcement learning and optimal adaptive control: An overview and implementation examples [J].
Khan, Said G. ;
Herrmann, Guido ;
Lewis, Frank L. ;
Pipe, Tony ;
Melhuish, Chris .
ANNUAL REVIEWS IN CONTROL, 2012, 36 (01) :42-59
[9]   A robust adaptive nonlinear control approach to missile autopilot design [J].
Kim, SH ;
Kim, YS ;
Song, CH .
CONTROL ENGINEERING PRACTICE, 2004, 12 (02) :149-154
[10]   Design and simulation of fault tolerant flight control based on a physical approach [J].
Lombaerts, T. J. J. ;
Looye, G. H. N. ;
Chu, Q. P. ;
Mulder, J. A. .
AEROSPACE SCIENCE AND TECHNOLOGY, 2012, 23 (01) :151-171