Game Theoretical Reinforcement Learning for Robust H∞ Tracking Control of Discrete-Time Linear Systems with Unknown Dynamics

被引:0
|
作者
Wu, Hao [1 ]
Li, Shaobao [1 ]
Durdevic, Petar [2 ]
Yang, Zhenyu [2 ]
机构
[1] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
[2] Aalborg Univ, Dept Energy Technol, DK-6700 Aalborg, Denmark
来源
2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021) | 2021年
关键词
H-infinity control; game theory; reinforcement learning; zero-sum game; de-oiling hydrocyclone system;
D O I
10.1109/ICoIAS53694.2021.00058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust H-infinity control has been widely studied to improve control performance for industrial process control systems against disturbances. However, most of existing robust H-infinity control are model-based, and their deployment in some industrial facilities may greatly increase the installation and maintenance costs due to requiring system identification. Towards this end, a model-free robust H-infinity tracking control scheme is developed based on game theoretical reinforcement learning (RL) for discrete-time linear systems with unknown dynamics. The normal robust H-infinity tracking control problem is first modeled as a two-player zero-sum game with the controller and disturbance as the two players. A model-based solution by solving game discrete-time differential Riccati equation (GDARE) is introduced to show the solvability of the robust H-infinity tracking control problem, and then a novel off-policy RL algorithm is developed to replace the GDARE method for model-free robust H-infinity tracking control of the discrete-time linear systems with unknown dynamics. Stability of the learning algorithm is analyzed. Finally, a simulation study upon a de-oiling hydrocyclone system is conducted to demonstrate the effectiveness of the proposed algorithm.
引用
收藏
页码:290 / 295
页数:6
相关论文
共 50 条
  • [1] Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
    Kiumarsi, Bahare
    Lewis, Frank L.
    Modares, Hamidreza
    Karimpour, Ali
    Naghibi-Sistani, Mohammad-Bagher
    AUTOMATICA, 2014, 50 (04) : 1167 - 1175
  • [2] H∞ tracking control for linear discrete-time systems via reinforcement learning
    Liu, Ying-Ying
    Wang, Zhan-Shan
    Shi, Zhan
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (01) : 282 - 301
  • [3] H∞ Optimal Control of Unknown Linear Discrete-time Systems: An Off-policy Reinforcement Learning Approach
    Kiumarsi, Bahare
    Modares, Hamidreza
    Lewis, Frank L.
    Jiang, Zhong-Ping
    PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 41 - 46
  • [4] Optimal Output Regulation of Linear Discrete-Time Systems With Unknown Dynamics Using Reinforcement Learning
    Jiang, Yi
    Kiumarsi, Bahare
    Fan, Jialu
    Chai, Tianyou
    Li, Jinna
    Lewis, Frank L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3147 - 3156
  • [5] Optimal Tracking Control for Linear Discrete-time Systems Using Reinforcement Learning
    Kiumarsi-Khomartash, Bahare
    Lewis, Frank L.
    Naghibi-Sistani, Mohammad-Bagher
    Karimpour, Ali
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3845 - 3850
  • [6] H∞ control of linear discrete-time systems: Off-policy reinforcement learning
    Kiumarsi, Bahare
    Lewis, Frank L.
    Jiang, Zhong-Ping
    AUTOMATICA, 2017, 78 : 144 - 152
  • [7] Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems
    Yang, Yongliang
    Guo, Zhishan
    Wunsch, Donald
    Yin, Yixin
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2507 - 2512
  • [8] Scaling policy iteration based reinforcement learning for unknown discrete-time linear systems
    Pang, Zhen
    Tang, Shengda
    Cheng, Jun
    He, Shuping
    AUTOMATICA, 2025, 176
  • [9] Robust H8 tracking of linear discrete-time systems using Q-learning
    Valadbeigi, Amir Parviz
    Shu, Zhan
    Khaki Sedigh, Ali
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (10) : 5604 - 5623
  • [10] Reinforcement Q-learning algorithm for H∞ tracking control of discrete-time Markov jump systems
    Shi, Jiahui
    He, Dakuo
    Zhang, Qiang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (03) : 502 - 523