Optimal Trajectory Output Tracking Control with a Q-learning Algorithm

被引：0

作者：

Vamvoudakis, Kyriakos G. ^{[1
]}

机构：

[1] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat CCDC, Santa Barbara, CA 93106 USA

来源：

2016 AMERICAN CONTROL CONFERENCE (ACC) | 2016年

关键词：

Q-learning; output trajectory tracking; uncertain systems; TIME LINEAR-SYSTEMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper a novel Q-learning algorithm is proposed to solve the Linear Quadratic Output Tracking (LQOT) control problem of a linear time invariant system with completely unknown system and reference dynamics. We first define an action-dependent value function for the LQOT problem after we augment the system and the reference states and pick appropriately the user-defined matrices in the performance index of the augmented state. An integral reinforcement learning approach is used to develop a reinforcement learning structure to estimate the parameters of the Q-function online while also guaranteeing closed-loop stability, trajectory tracking and convergence to the optimal tracking solution. A simulation result of an unknown spring-mass-damper linear system is presented to show the efficacy of the proposed approach.

引用

页码：5752 / 5757

页数：6

共 28 条

[1]

Anderson B. D., 2007, OPTIMAL CONTROL LINE

[2]

[Anonymous], 2007, WILEY SERIES PROBABI

[3]

[Anonymous], 2008, P 25 INT C MACHINE L

[4]

Bertsekas D. P., 1996, NEURODYNAMIC PROGRAM

[5]

Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f

[6]

Gao WN, 2015, P AMER CONTR CONF, P4929, DOI 10.1109/ACC.2015.7172106

[7]

Ioannou P., 2006, Advances in design and control

[8] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics [J].

Jiang, Yu ;

Jiang, Zhong-Ping .

AUTOMATICA, 2012, 48 (10) :2699-2704

[9] Approximate optimal trajectory tracking for continuous-time nonlinear systems [J].

Kamalapurkar, Rushikesh ;

Dinh, Huyen ;

Bhasin, Shubhendu ;

Dixon, Warren E. .

AUTOMATICA, 2015, 51 :40-48

[10]

Kiumarsi B., IEEE T CYBE IN PRESS

← 1 2 3 →