Kernel-based Adaptive Critic Designs for Optimal Control of Nonlinear Discrete-time System

被引：0

作者：

Tan, Fuxiao ^{[1
]}

Guan, Xinping ^{[2
]}

机构：

[1] Fuyang Normal Univ, Sch Comp & Informat Engn, Fuyang 236037, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

来源：

2018 37TH CHINESE CONTROL CONFERENCE (CCC) | 2018年

关键词：

Adaptive Dynamic Programming (ADP); Adaptive Critic Designs (ACDs); Online Kernel Learning; Nonlinear Discrete-time System; Optimal Control; ONLINE LEARNING CONTROL; POLICY ITERATION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Because it is necessary to solve the nonlinear Hamilton-Jacobi-Bellman (HJB) partial differential equations or the nonlinear two-point boundary value problems, the optimal control of nonlinear system is quite difficultly. Based on adaptive dynamic programming ( ADP) and adaptive critic designs (ACDs), the optimal control of nonlinear system theory has been intensive studied. However, based on neural network, the ADP still have the problem of network structure design and learning parameters optimization. Because the kernel function can reflect the similarity among data by the form of inner product of data, the traditional linear learning algorithm can be extended to nonlinear theoretical framework by kernel function. Based on online kernel learning, the kernel-based adaptive critic designs is studied in this paper. Then, the critic of ADP is designed by using the kernel recursive least square algorithm. Thus, the realization process of the KHDP and KDHP algorithms are obtained. Finally, example and computer simulation are shown that the kernel-based ACDs can effectively solve the optimal control of nonlinear discrete-time system.

引用

页码：2167 / 2172

页数：6

共 32 条

[1]

[Anonymous], IEEE T NEURAL NETW L

[2]

[Anonymous], 2012, REINFORCEMENT LEARNI

[3]

[Anonymous], 2002, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

[4]

[Anonymous], 1996, Neuro-dynamic programming

[5]

[Anonymous], 2003, J. Mach. Learn. Res.

[6] Online Selective Kernel-Based Temporal Difference Learning [J].

Chen, Xingguo ;

Gao, Yang ;

Wang, Ruili .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (12) :1944-1956

[7]

Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482

[8] The kernel recursive least-squares algorithm [J].

Engel, Y ;

Mannor, S ;

Meir, R .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2275-2285

[9] Analyzing Sparse Dictionaries for Online Learning With Kernels [J].

Honeine, Paul .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (23) :6343-6353

[10]

Ingo Steinwart e Andreas Christmann., 2008, Support Vector Machines [WWW Document], V1st

← 1 2 3 4 →