Optimal Control of Affine Nonlinear Continuous-time Systems Using an Online Hamilton-Jacobi-Isaacs Formulation

被引:55
作者
Dierks, T. [1 ]
Jagannathan, S. [2 ]
机构
[1] DRS Sustainment Syst Inc, 201 Evans Lane, St Louis, MO 63121 USA
[2] Univ Missouri Sci & Technol, Dept Elect & Comp Engn, St Rolla, MO 65211 USA
来源
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC) | 2010年
基金
美国国家科学基金会;
关键词
TRACKING;
D O I
10.1109/CDC.2010.5717676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Solving the Hamilton-Jacobi-Isaacs (HJI) equation, commonly used in H-infinity optimal control, is often referred to as a two-player differential game where one player tries to minimize the cost function while the other tries to maximize it. In this paper, the HJI equation is formulated online and forward-in-time using a novel single online approximator (SOLA)-based scheme to achieve optimal regulation and tracking control of affine nonlinear continuous-time systems. The SOLA-based adaptive approach is designed to learn the infinite horizon HJI equation, the corresponding optimal control input, and the worst case disturbance. A novel parameter tuning algorithm is derived which not only achieves the optimal cost function, control input, and the disturbance, but also ensures the system states remain bounded during the online learning. Lyapunov methods are used to show that all signals are uniformly ultimately bounded (UUB) while ensuring the approximated signals approach their optimal values with small bounded error. In the absence of OLA reconstruction errors, asymptotic convergence to the optimal signals is demonstrated, and simulation results illustrate the effectiveness of the approach.
引用
收藏
页码:3048 / 3053
页数:6
相关论文
共 16 条
[1]   Neurodynamic programming and zero-sum games for constrained control systems [J].
Abu-Khalaf, Murad ;
Lewis, Frank L. ;
Huang, Jie .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (07) :1243-1252
[2]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[3]  
[Anonymous], 1999, Neural network control of robot manipulators and nonlinear systems
[4]   Successive Galerkin approximation algorithms for nonlinear optimal and robust control [J].
Beard, RW ;
McLain, TW .
INTERNATIONAL JOURNAL OF CONTROL, 1998, 71 (05) :717-743
[5]  
Dierks T, 2010, P AMER CONTR CONF, P1568
[6]   Optimal Control of Affine Nonlinear Discrete-time Systems [J].
Dierks, Travis ;
Jagannthan, S. .
MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3, 2009, :1390-1395
[7]  
Doyle J, 1996, IEEE DECIS CONTR P, P3915, DOI 10.1109/CDC.1996.577292
[8]   An algorithm to solve the discrete HJI equation arising in the L2 gain optimization problem [J].
Huang, J .
INTERNATIONAL JOURNAL OF CONTROL, 1999, 72 (01) :49-57
[9]  
Khalil H., 2002, Control of Nonlinear Systems
[10]  
KNOBLOCK HW, 1993, TOPICS CONTROL THEOR