Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem

被引:1
作者
Sharma, Shashi Kant [1 ]
Jha, Sumit Kumar [1 ]
Dhawan, Amit [1 ]
Tiwari, Manish [1 ]
机构
[1] MNNIT Allahabad, Dept Elect & Commun Engn, Prayagraj 211004, India
关键词
Adaptive optimal control; algebraic Riccati equation; linear quadratic tracking; Q-learning; CONTINUOUS-TIME SYSTEMS;
D O I
10.1007/s12555-022-0364-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a Q-learning based algorithm to design the linear quadratic tracker (LQT) for linear time invariant (LTI) continuous-time systems with partially unknown dynamics. The proposed approach uses a fixed-point equation in terms of Q-function in order to estimate the unknown optimal gain parameters. The fixedpoint equation, which is derived by applying the Pontryagin's minimum principle in Q-learning, is based on the modified algebraic Riccati equation (ARE) for LQT problem. The online adaptation of the optimal parameters are achieved by using the gradient descent based parameter update laws by minimizing the Bellman's error term which is derived from fixed-point equation mentioned earlier. A persistence of excitation condition has been used to establish the desired optimal convergence of the estimated control parameters. Simulation results have been shown to validate the efficiency of the proposed Q-learning approach.
引用
收藏
页码:2718 / 2725
页数:8
相关论文
共 30 条