Optimal Trajectory Output Tracking Control with a Q-learning Algorithm

被引:0
作者
Vamvoudakis, Kyriakos G. [1 ]
机构
[1] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat CCDC, Santa Barbara, CA 93106 USA
来源
2016 AMERICAN CONTROL CONFERENCE (ACC) | 2016年
关键词
Q-learning; output trajectory tracking; uncertain systems; TIME LINEAR-SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper a novel Q-learning algorithm is proposed to solve the Linear Quadratic Output Tracking (LQOT) control problem of a linear time invariant system with completely unknown system and reference dynamics. We first define an action-dependent value function for the LQOT problem after we augment the system and the reference states and pick appropriately the user-defined matrices in the performance index of the augmented state. An integral reinforcement learning approach is used to develop a reinforcement learning structure to estimate the parameters of the Q-function online while also guaranteeing closed-loop stability, trajectory tracking and convergence to the optimal tracking solution. A simulation result of an unknown spring-mass-damper linear system is presented to show the efficacy of the proposed approach.
引用
收藏
页码:5752 / 5757
页数:6
相关论文
共 28 条
[1]  
Anderson B. D., 2007, OPTIMAL CONTROL LINE
[2]  
[Anonymous], 2007, WILEY SERIES PROBABI
[3]  
[Anonymous], 2008, P 25 INT C MACHINE L
[4]  
Bertsekas D. P., 1996, NEURODYNAMIC PROGRAM
[5]  
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[6]  
Gao WN, 2015, P AMER CONTR CONF, P4929, DOI 10.1109/ACC.2015.7172106
[7]  
Ioannou P., 2006, Advances in design and control
[8]   Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics [J].
Jiang, Yu ;
Jiang, Zhong-Ping .
AUTOMATICA, 2012, 48 (10) :2699-2704
[9]   Approximate optimal trajectory tracking for continuous-time nonlinear systems [J].
Kamalapurkar, Rushikesh ;
Dinh, Huyen ;
Bhasin, Shubhendu ;
Dixon, Warren E. .
AUTOMATICA, 2015, 51 :40-48
[10]  
Kiumarsi B., IEEE T CYBE IN PRESS