Adaptive optimal control of unknown discrete-time linear systems with guaranteed prescribed degree of stability using reinforcement learning

被引:0
作者
Seyed Ehsan Razavi
Mohammad Amin Moradi
Saeed Shamaghdari
Mohammad Bagher Menhaj
机构
[1] Amirkabir University of Technology,School of Electrical Engineering
[2] Iran University of Science and Technology,School of Electrical Engineering
来源
International Journal of Dynamics and Control | 2022年 / 10卷
关键词
Model-free optimal control; Reinforcement learning; Policy iteration; Convergence rate; Degree of stability;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a model-free solution for solving the optimal regulation problem for a discrete-time linear time-invariant system that unlike previous methods, presents a guaranteed convergence rate of the state variables as is needed in a group of problems. Initially, the Linear Quadratic Regulation problem (LQR) with a guaranteed convergence rate of the state is formulated for a system with known dynamics and the associated Riccati equation is derived. Solving the Riccati equation and finding the state feedback gain requires full knowledge of the dynamics of the system. To overcome this problem, the Policy Iteration (PI) Reinforcement Learning (RL) algorithm is formulated to solve the LQR problem with a guaranteed convergence rate, and the optimal state feedback gain is derived without having any knowledge about the dynamics of the system and only through the measurement of the states of the system. Eventually, the validity of the results is shown through simulation.
引用
收藏
页码:870 / 878
页数:8
相关论文
共 30 条
[1]  
Kleinman D(1968)On an iterative technique for Riccati equation computations IEEE Trans Autom Control 13 114-115
[2]  
Bhasin S(2011)Asymptotic tracking by a reinforcement learning-based adaptive critic controller J Control Theory Appl 9 400-409
[3]  
Vamvoudakis KG(2017)Game theory-based control system algorithms with real-time reinforcement learning: how to solve multiplayer games online IEEE Control Syst Mag 37 33-52
[4]  
Kiumarsi B(2018)Optimal and autonomous control using reinforcement learning: a survey IEEE Trans Neural Netw Learn Syst 29 2042-2062
[5]  
Gao W(2016)Adaptive dynamic programming and adaptive optimal output regulation of linear systems IEEE Trans Autom Control 61 4164-4169
[6]  
Jiang Z-P(2019)Output-feedback H∞ quadratic tracking control of linear systems using reinforcement learning Int J Adapt Control Signal Process 33 300-314
[7]  
Moghadam R(2015)${H} _ {{\infty}} $ Tracking control of completely unknown continuous-time systems via off-policy reinforcement learning IEEE Trans Neural Netw Learn Syst 26 2550-2562
[8]  
Lewis FL(2017)Data-driven optimal consensus control for discrete-time multi-agent systems with unknown dynamics using reinforcement learning method IEEE Trans Industr Electron 64 4091-4100
[9]  
Modares H(2016)Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning Automatica 71 334-341
[10]  
Zhang H(1988)Learning to predict by the methods of temporal differences Mach Learn 3 9-44