Game Theoretical Reinforcement Learning for Robust H∞ Tracking Control of Discrete-Time Linear Systems with Unknown Dynamics

被引：0

作者：

Wu, Hao ^{[1
]}

Li, Shaobao ^{[1
]}

Durdevic, Petar ^{[2
]}

Yang, Zhenyu ^{[2
]}

机构：

[1] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China

[2] Aalborg Univ, Dept Energy Technol, DK-6700 Aalborg, Denmark

来源：

2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021) | 2021年

关键词：

H-infinity control; game theory; reinforcement learning; zero-sum game; de-oiling hydrocyclone system;

D O I：

10.1109/ICoIAS53694.2021.00058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Robust H-infinity control has been widely studied to improve control performance for industrial process control systems against disturbances. However, most of existing robust H-infinity control are model-based, and their deployment in some industrial facilities may greatly increase the installation and maintenance costs due to requiring system identification. Towards this end, a model-free robust H-infinity tracking control scheme is developed based on game theoretical reinforcement learning (RL) for discrete-time linear systems with unknown dynamics. The normal robust H-infinity tracking control problem is first modeled as a two-player zero-sum game with the controller and disturbance as the two players. A model-based solution by solving game discrete-time differential Riccati equation (GDARE) is introduced to show the solvability of the robust H-infinity tracking control problem, and then a novel off-policy RL algorithm is developed to replace the GDARE method for model-free robust H-infinity tracking control of the discrete-time linear systems with unknown dynamics. Stability of the learning algorithm is analyzed. Finally, a simulation study upon a de-oiling hydrocyclone system is conducted to demonstrate the effectiveness of the proposed algorithm.

引用

页码：290 / 295

页数：6

共 50 条

[1] Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
Kiumarsi, Bahare
Lewis, Frank L.
Modares, Hamidreza
Karimpour, Ali
Naghibi-Sistani, Mohammad-Bagher
AUTOMATICA, 2014, 50 (04) : 1167 - 1175
[2] H∞ tracking control for linear discrete-time systems via reinforcement learning
Liu, Ying-Ying
Wang, Zhan-Shan
Shi, Zhan
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (01) : 282 - 301
[3] H∞ Optimal Control of Unknown Linear Discrete-time Systems: An Off-policy Reinforcement Learning Approach
Kiumarsi, Bahare
Modares, Hamidreza
Lewis, Frank L.
Jiang, Zhong-Ping
PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 41 - 46
[4] Optimal Output Regulation of Linear Discrete-Time Systems With Unknown Dynamics Using Reinforcement Learning
Jiang, Yi
Kiumarsi, Bahare
Fan, Jialu
Chai, Tianyou
Li, Jinna
Lewis, Frank L.
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3147 - 3156
[5] Optimal Tracking Control for Linear Discrete-time Systems Using Reinforcement Learning
Kiumarsi-Khomartash, Bahare
Lewis, Frank L.
Naghibi-Sistani, Mohammad-Bagher
Karimpour, Ali
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3845 - 3850
[6] H∞ control of linear discrete-time systems: Off-policy reinforcement learning
Kiumarsi, Bahare
Lewis, Frank L.
Jiang, Zhong-Ping
AUTOMATICA, 2017, 78 : 144 - 152
[7] Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems
Yang, Yongliang
Guo, Zhishan
Wunsch, Donald
Yin, Yixin
PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2507 - 2512
[8] Scaling policy iteration based reinforcement learning for unknown discrete-time linear systems
Pang, Zhen
Tang, Shengda
Cheng, Jun
He, Shuping
AUTOMATICA, 2025, 176
[9] Robust H8 tracking of linear discrete-time systems using Q-learning
Valadbeigi, Amir Parviz
Shu, Zhan
Khaki Sedigh, Ali
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (10) : 5604 - 5623
[10] Reinforcement Q-learning algorithm for H∞ tracking control of discrete-time Markov jump systems
Shi, Jiahui
He, Dakuo
Zhang, Qiang
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (03) : 502 - 523

← 1 2 3 4 5 →