Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems

被引：191

作者：

Jiang, Yu ^{[1
]}

Jiang, Zhong-Ping ^{[2
]}

机构：

[1] The MathWorks, Natick, MA 01760 USA

[2] NYU, Polytech Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2015年 / 60卷 / 11期

基金：

美国国家科学基金会;

关键词：

Adaptive dynamic programming; global stabilization; nonlinear systems; optimal control; STABILIZATION; APPROXIMATIONS; OBSERVABILITY; CONTROLLERS; DESIGN;

D O I：

10.1109/TAC.2015.2414811

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a novel method of global adaptive dynamic programming (ADP) for the adaptive optimal control of nonlinear polynomial systems. The strategy consists of relaxing the problem of solving the Hamilton-Jacobi-Bellman (HJB) equation to an optimization problem, which is solved via a new policy iteration method. The proposed method distinguishes from previously known nonlinear ADP methods in that the neural network approximation is avoided, giving rise to significant computational improvement. Instead of semiglobally or locally stabilizing, the resultant control policy is globally stabilizing for a general class of nonlinear polynomial systems. Furthermore, in the absence of the a priori knowledge of the system dynamics, an online learning method is devised to implement the proposed policy iteration technique by generalizing the current ADP theory. Finally, three numerical examples are provided to validate the effectiveness of the proposed method.

引用

页码：2917 / 2929

页数：13

共 61 条

[1] Forward completeness, unboundedness observability, and their Lyapunov characterizations [J].