Data-driven output feedback optimal control for a class of nonlinear systems via adaptive dynamic programming approach: Part I-Algorithms

被引：0

作者：

Wang, Yebin ^{[1
]}

机构：

[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

来源：

2018 37TH CHINESE CONTROL CONFERENCE (CCC) | 2018年

关键词：

ROBOTIC MANIPULATORS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Approximate/adaptive dynamic programming (ADP) has demonstrated great successes in the construction of data-driven output feedback optimal control for linear time-invariant systems and data-driven state feedback optimal control for nonlinear systems. This work investigates data-driven output feedback optimal control design for a class of nonlinear systems. It proposes to parameterize all admissible output feedback optimal control policies over accessible signals (system output and its time derivatives). In the case that system state can be parameterized as functions of accessible signals, then the value function and control policy can be parameterized over accessible signals, which allow ADP to be driven by accessible data. For a special case, where system state, value function and control policy can be linearly parameterized over a finite functional space over accessible signals, the policy iteration algorithm (PI) of ADP is reduced to solve a system of linear equations. Two data-driven PIs are developed to accomplish data-driven output feedback optimal control design. Simulation validates the proposed methodology.

引用

页码：2926 / 2932

页数：7

共 22 条

[1]

[Anonymous], 1995, NONLINEAR ADAPTIVE C

[2] Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].