Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach

被引：0

作者：

Qinglai Wei

Derong Liu

Yancai Xu

机构：

[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation

来源：

Soft Computing | 2016年 / 20卷

关键词：

Adaptive dynamic programming; Approximate dynamic programming; Adaptive critic designs; Optimal control; Neural networks; Nonlinear systems; Reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, a novel value iteration adaptive dynamic programming (ADP) algorithm, called “generalized value iteration ADP” algorithm, is developed to solve infinite horizon optimal tracking control problems for a class of discrete-time nonlinear systems. The developed generalized value iteration ADP algorithm permits an arbitrary positive semi-definite function to initialize it, which overcomes the disadvantage of traditional value iteration algorithms. Convergence property is developed to guarantee that the iterative performance index function will converge to the optimum. Neural networks are used to approximate the iterative performance index function and compute the iterative control policy, respectively, to implement the iterative ADP algorithm. Finally, a simulation example is given to illustrate the performance of the developed algorithm.

引用

页码：697 / 706

页数：9

共 50 条

[1] Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach
Wei, Qinglai
Liu, Derong
Xu, Yancai
SOFT COMPUTING, 2016, 20 (02) : 697 - 706
[2] Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
Wang, Ding
Liu, Derong
Wei, Qinglai
NEUROCOMPUTING, 2012, 78 (01) : 14 - 22
[3] Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
Wei, Qinglai
Liu, Derong
Lin, Hanquan
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 840 - 853
[4] Optimal Learning Control for Discrete-Time Nonlinear Systems Using Generalized Policy Iteration Based Adaptive Dynamic Programming
Wei, Qinglai
Liu, Derong
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1781 - 1786
[5] Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
Liu, Derong
Wei, Qinglai
Yan, Pengfei
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (12): : 1577 - 1591
[6] Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics
Wang, Ding
Ha, Mingming
Cheng, Long
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4237 - 4248
[7] Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems
Liu, Derong
Wei, Qinglai
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (03) : 621 - 634
[8] Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming
Wei, Qinglai
Liu, Derong
Lin, Qiao
Song, Ruizhuo
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3367 - 3379
[9] A Novel Iterative θ-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
Wei, Qinglai
Liu, Derong
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (04) : 1176 - 1190
[10] Optimal Tracking Control for a Class of Nonlinear Discrete-Time Systems with Time Delays Based on Heuristic Dynamic Programming
Zhang, Huaguang
Song, Ruizhuo
Wei, Qinglai
Zhang, Tieyan
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12): : 1851 - 1862

← 1 2 3 4 5 →