Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics

被引:840
作者
Jiang, Yu [1 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Dept Elect & Comp Engn, Polytech Inst, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Adaptive optimal control; Policy iterations; Linear-quadratic regulator (LQR); ZERO-SUM GAMES; FEEDBACK-CONTROL;
D O I
10.1016/j.automatica.2012.06.096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel policy iteration approach for finding online adaptive optimal controllers for continuous-time linear systems with completely unknown system dynamics. The proposed approach employs the approximate/adaptive dynamic programming technique to iteratively solve the algebraic Riccati equation using the online information of state and input, without requiring the a priori knowledge of the system matrices. In addition, all iterations can be conducted by using repeatedly the same state and input information on some fixed time intervals. A practical online algorithm is developed in this paper, and is applied to the controller design for a turbocharged diesel engine with exhaust gas recirculation. Finally, several aspects of future work are discussed. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2699 / 2704
页数:6
相关论文
共 35 条
[1]   Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
AUTOMATICA, 2007, 43 (03) :473-481
[2]  
[Anonymous], 1995, NONLINEAR ADAPTIVE C
[3]  
[Anonymous], 1989, THESIS CAMBRIDGE U
[4]  
BAIRD LC, 1994, 1994 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOL 1-7, P2448, DOI 10.1109/ICNN.1994.374604
[5]   Asymptotic tracking by a reinforcement learning-based adaptive critic controller [J].
Bhasin S. ;
Sharma N. ;
Patre P. ;
Dixon W. .
Journal of Control Theory and Applications, 2011, 9 (3) :400-409
[6]  
BRADTKE SJ, 1994, PROCEEDINGS OF THE 1994 AMERICAN CONTROL CONFERENCE, VOLS 1-3, P3475
[7]   Online optimal control of nonlinear discrete-time systems using approximate dynamic programming [J].
Dierks T. ;
Jagannathan S. .
Journal of Control Theory and Applications, 2011, 9 (3) :361-369
[8]   Reinforcement learning in continuous time and space [J].
Doya, K .
NEURAL COMPUTATION, 2000, 12 (01) :219-245
[9]   Adaptive feedback control by constrained approximate dynamic programming [J].
Ferrari, Silvia ;
Steck, James E. ;
Chandramohan, Rajeev .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :982-987
[10]  
Jiang Y, 2011, IEEE DECIS CONTR P, P115, DOI 10.1109/CDC.2011.6160279