Optimal Control of Nonlinear Continuous-Time Systems in Strict-Feedback Form

被引：139

作者：

Zargarzadeh, Hassan ^{[1
]}

Dierks, Travis ^{[2
]}

Jagannathan, Sarangapani ^{[3
]}

机构：

[1] Lamar Univ, Dept Elect Engn, Beaumont, TX 77710 USA

[2] DRS Sustainment Syst Inc, St Louis, MO 63121 USA

[3] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2015年 / 26卷 / 10期

基金：

美国国家科学基金会;

关键词：

Adaptive backstepping; adaptive control; neural network (NN)-based dynamic programming; nonlinear strict-feedback systems; optimal control; DYNAMICS; TRACKING; DESIGN;

D O I：

10.1109/TNNLS.2015.2441712

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel optimal tracking control scheme for nonlinear continuous-time systems in strict-feedback form with uncertain dynamics. The optimal tracking problem is transformed into an equivalent optimal regulation problem through a feedforward adaptive control input that is generated by modifying the standard backstepping technique. Subsequently, a neural network-based optimal control scheme is introduced to estimate the cost, or value function, over an infinite horizon for the resulting nonlinear continuous-time systems in affine form when the internal dynamics are unknown. The estimated cost function is then used to obtain the optimal feedback control input; therefore, the overall optimal control input for the nonlinear continuous-time system in strict-feedback form includes the feedforward plus the optimal feedback terms. It is shown that the estimated cost function minimizes the Hamilton-Jacobi-Bellman estimation error in a forward-in-time manner without using any value or policy iterations. Finally, optimal output feedback control is introduced through the design of a suitable observer. Lyapunov theory is utilized to show the overall stability of the proposed schemes without requiring an initial admissible controller. Simulation examples are provided to validate the theoretical results.

引用

页码：2535 / 2549

页数：15

共 27 条

[1] Successive Galerkin approximation algorithms for nonlinear optimal and robust control [J].

Beard, RW ;

McLain, TW .

INTERNATIONAL JOURNAL OF CONTROL, 1998, 71 (05) :717-743

[2] Adaptive control with guaranteed transient and steady state tracking error bounds for strict feedback systems [J].

Bechlioulis, Charalampos P. ;

Rovithakis, George A. .

AUTOMATICA, 2009, 45 (02) :532-538

[3] Online optimal control of nonlinear discrete-time systems using approximate dynamic programming [J].

Dierks T. ;

Jagannathan S. .

Journal of Control Theory and Applications, 2011, 9 (3) :361-369

[4]

Dierks T, 2010, P AMER CONTR CONF, P1568

[5] Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update [J].

Dierks, Travis ;

Jagannathan, Sarangapani .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1118-1129

[6] Optimal Control of Affine Nonlinear Discrete-time Systems [J].

Dierks, Travis ;

Jagannthan, S. .

MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3, 2009, :1390-1395

[7] Neural-Network-Based State Feedback Control of a Nonlinear Discrete-Time System in Nonstrict Feedback Form [J].

Jagannathan, Sarangapani ;

He, Pingan .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (12) :2073-2087

[8]

Khalil H.K., 2002, Nonlinear systems, V3rd

[9]

Krstic M., 1995, Nonlinear and Adaptive Control Design

[10]

Lewis F. L., 1999, NEURAL NETWORK CONTR

← 1 2 3 →