Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems

被引：135

作者：

Gao, Weinan ^{[1
]}

Jiang, Zhong-Ping ^{[2
]}

机构：

[1] Georgia Southern Univ, Allen E Paulson Coll Engn & Informat Technol, Dept Elect Engn, Statesboro, GA 30460 USA

[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2018年 / 29卷 / 06期

基金：

美国国家科学基金会;

关键词：

Adaptive dynamic programming (ADP); adaptive optimal tracking; optimal control; uncertain nonlinear systems; CONTINUOUS-TIME SYSTEMS; OUTPUT REGULATION;

D O I：

10.1109/TNNLS.2017.2761718

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel data-driven control approach to address the problem of adaptive optimal tracking for a class of nonlinear systems taking the strict-feedback form. Adaptive dynamic programming (ADP) and nonlinear output regulation theories are integrated for the first time to compute an adaptive near-optimal tracker without any a priori knowledge of the system dynamics. Fundamentally different from adaptive optimal stabilization problems, the solution to a HamiltonJacobi- Bellman (HJB) equation, not necessarily a positive definite function, cannot be approximated through the existing iterative methods. This paper proposes a novel policy iteration technique for solving positive semidefinite HJB equations with rigorous convergence analysis. A two-phase data-driven learning method is developed and implemented online by ADP. The efficacy of the proposed adaptive optimal tracking control methodology is demonstrated via a Van der Pol oscillator with time-varying exogenous signals.

引用

页码：2614 / 2624

页数：11

共 38 条

[1]

[Anonymous], EUR J CONTROL

[2]

[Anonymous], 1990, Adaptive Optimal Control the Thinking Man's GPC

[3]

[Anonymous], 1996, Neuro-dynamic programming

[4]

[Anonymous], 1995, NONLINEAR ADAPTIVE C

[5]

[Anonymous], 2007, Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)

[6]

[Anonymous], PROGR SYSTEMS CONTRO

[7] Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design [J].

Bian, Tao ;

Jiang, Zhong-Ping .

AUTOMATICA, 2016, 71 :348-360

[8] Structurally stable output regulation of nonlinear systems [J].

Byrnes, CI ;

Priscoli, FD ;

Isidori, A ;

Kang, W .

AUTOMATICA, 1997, 33 (03) :369-385

[9] Adaptive continuous-time linear quadratic Gaussian control [J].

Duncan, TE ;

Guo, L ;

Pasik-Duncan, B .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1999, 44 (09) :1653-1662

[10] Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].

Fan, Quan-Yong ;

Yang, Guang-Hong .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177

← 1 2 3 4 →