A New Optimal Control Method for Discrete-Time Nonlinear Systems with Approximation Error

被引:0
作者
Wei, Qinglai [1 ]
Liu, Derong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
来源
PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012) | 2012年
基金
北京市自然科学基金; 中国博士后科学基金; 中国国家自然科学基金;
关键词
Adaptive dynamic programming; approximate dynamic programming; discrete-time systems; optimal control; convergence conditions; CONTROL SCHEME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new optimal control method is proposed for discrete-time nonlinear systems based on iterative adaptive dynamic programming (ADP) algorithm with approximation error. In each iteration of the proposed algorithm, the iterative control law and iterative performance index function cannot be accurately obtained. The convergence conditions of the iterative ADP algorithm are presented. According to the convergence conditions, the iterative performance index functions are proved to be convergent to a small neighborhood of the optimal performance index function. Finally, a simulation example is given to illustrate the performance of the proposed method.
引用
收藏
页码:185 / 190
页数:6
相关论文
共 20 条
[1]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[2]   Adaptive critic designs for discrete-time zero-sum games with application to H∞ control [J].
Al-Tamimi, Asma ;
Abu-Khalaf, Murad ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01) :240-247
[3]  
[Anonymous], 1996, Neuro-dynamic programming
[4]   Continuous-time adaptive critics [J].
Hanselmann, Thomas ;
Noakes, Lyle ;
Zaknich, Anthony .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (03) :631-647
[5]   Discrete-time adaptive dynamic programming using wavelet basis function neural networks [J].
Jin, Ning ;
Liu, Derong ;
Huang, Ting ;
Pang, Zhongyu .
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, :135-+
[6]   Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control [J].
Lewis, Frank L. ;
Vrabie, Draguna .
IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2009, 9 (03) :32-50
[7]  
LIU D, 2005, INT J INTELLIGENT CO, V10, P21
[8]   A self-learning call admission control scheme for CDMA cellular networks [J].
Liu, DR ;
Zhang, Y ;
Zhang, HG .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (05) :1219-1228
[9]   Adaptive dynamic programming [J].
Murray, JJ ;
Cox, CJ ;
Lendaris, GG ;
Saeks, R .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (02) :140-153
[10]   Adaptive critic designs [J].
Prokhorov, DV ;
Wunsch, DC .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05) :997-1007