Controller Optimization for Multirate Systems Based on Reinforcement Learning

被引：0

作者：

Zhan Li

Sheng-Ri Xue

Xing-Hu Yu

Hui-Jun Gao

机构：

[1] Harbin Institute of Technology,Research Institute of Intelligent Control and Systems

[2] Harbin Institute of Technology,Ningbo Institute of Intelligent Equipment Technology

来源：

International Journal of Automation and Computing | 2020年 / 17卷

关键词：

Multirate system; reinforcement learning; policy iteration; optimal control; controller optimization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The goal of this paper is to design a model-free optimal controller for the multirate system based on reinforcement learning. Sampled-data control systems are widely used in the industrial production process and multirate sampling has attracted much attention in the study of the sampled-data control theory. In this paper, we assume the sampling periods for state variables are different from periods for system inputs. Under this condition, we can obtain an equivalent discrete-time system using the lifting technique. Then, we provide an algorithm to solve the linear quadratic regulator (LQR) control problem of multirate systems with the utilization of matrix substitutions. Based on a reinforcement learning method, we use online policy iteration and off-policy algorithms to optimize the controller for multirate systems. By using the least squares method, we convert the off-policy algorithm into a model-free reinforcement learning algorithm, which only requires the input and output data of the system. Finally, we use an example to illustrate the applicability and efficiency of the model-free algorithm above mentioned.

引用

页码：417 / 427

页数：10

共 133 条

[1] Shi P(1998)Filtering on sampled-data systems with parametric uncertainty IEEE Transactions on Automatic Control 43 1022-1027
[2] Han X J(2019)Sampled-data robust H∞ control for T-S fuzzy time-delay systems with state quantization International Journal of Control, Automation and Systems 17 46-56
[3] Ma Y C(2017)Control of uncertain sampled-data systems: An adaptive posicast control approach IEEE Transactions on Automatic Control 62 2597-2602
[4] Abidi K(2018)An observer based sampled-data control for class of scalar nonlinear systems using continualized discretization method International Journal of Control, Automation and Systems 16 709-716
[5] Yildiz Y(2018)Sampled-data fuzzy control of two-wheel inverted pendulums based on passivity theory International Journal of Control, Automation and Systems 16 2538-2648
[6] Annaswamy A(1959)A unified approach to the theory of sampling systems Journal of the Franklin Institute 267 405-436
[7] Nguyen-Van T(1990)A new class of shift-varying operators, their shift-invariant equivalents, and multirate digital systems IEEE Transactions on Automatic Control 35 429-433
[8] Liu R J(1994)H∞ design of general multirate sampled-data control systems Automatica 30 1139-1152
[9] Wu J F(1998)H∞ control of multirate sampled-data systems: A state-space approach Automatica 34 415-428
[10] Wang D(1998)Direct state space solution of multirate sampled-data H2 optimal control Automatica 34 1431-1437

← 1 2 3 4 5 6 7 8 9 10 →