Data-Based Optimal Tracking Control of Nonaffine Nonlinear Discrete-Time Systems

被引:0
作者
Luo, Biao [1 ]
Liu, Derong [2 ]
Huang, Tingwen [3 ]
Li, Chao [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[3] Texas A&M Univ Qatar, POB 23874, Doha, Qatar
来源
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV | 2016年 / 9950卷
关键词
Optimal tracking control; Data-based; Q-learning; Critic-only; H-INFINITY CONTROL; LINEAR-SYSTEMS; CONTROL SCHEME; APPROXIMATION; ITERATION;
D O I
10.1007/978-3-319-46681-1_68
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The optimal tracking control problem of nonaffine nonlinear discrete-time systems is considered in this paper. The problem relies on the solution of the so-called tracking Hamilton-Jacobi-Bellman equation, which is extremely difficult to be solved even for simple systems. To overcome this difficulty, the data-based Q-learning algorithm is proposed by learning the optimal tracking control policy from data of the practical system. For its implementation purpose, the critic-only neural network structure is developed, where only critic neural network is required to estimate the Q-function and the least-square scheme is employed to update the weight of neural network.
引用
收藏
页码:573 / 581
页数:9
相关论文
共 26 条
[21]   A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm [J].
Zhang, Huaguang ;
Wei, Qinglai ;
Luo, Yanhong .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :937-942
[22]   Optimal Tracking Control for a Class of Nonlinear Discrete-Time Systems with Time Delays Based on Heuristic Dynamic Programming [J].
Zhang, Huaguang ;
Song, Ruizhuo ;
Wei, Qinglai ;
Zhang, Tieyan .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12) :1851-1862
[23]   Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method [J].
Zhang, Huaguang ;
Cui, Lili ;
Zhang, Xin ;
Luo, Yanhong .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12) :2226-2236
[24]   MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems [J].
Zhao, Dongbin ;
Zhu, Yuanheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (02) :346-356
[25]  
Zhong X., 2016, IEEE T CYBERN, VPP, P1
[26]   Adaptive Suboptimal Output-Feedback Control for Linear Systems Using Integral Reinforcement Learning [J].
Zhu, Lemei M. ;
Modares, Hamidreza ;
Peen, Gan Oon ;
Lewis, Frank L. ;
Yue, Baozeng .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (01) :264-273