Reinforcement Q-learning algorithm for H∞ tracking control of discrete-time Markov jump systems

被引:0
|
作者
Shi, Jiahui [1 ]
He, Dakuo [1 ,2 ]
Zhang, Qiang [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
[2] State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Markov jump systems; reinforcement learning; H-infinity tracking control; tracking game algebraicRiccati equation; NONLINEAR-SYSTEMS; DESIGN;
D O I
10.1080/00207721.2024.2395928
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the H-infinity tracking control problem of linear discrete-time Markov jump systems is studied by using the data-based reinforcement learning method. Specifically, a new performance index function is established by using Markov chain and weighted sum technique, and thus the tracking game algebraic Riccati equation with weight vector and discount factor is obtained. A Q-learning algorithm is proposed to solve the tracking game algebra Riccati equation problem online without knowing the information of the system model. In addition, the convergence analysis of the algorithm is given, and it is proved that the added probing noise will not bias the algorithm. Finally, two simulation examples are given to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:502 / 523
页数:22
相关论文
共 50 条
  • [41] Q-Learning Methods for LQR Control of Completely Unknown Discrete-Time Linear Systems
    Fan, Wenwu
    Xiong, Junlin
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 5933 - 5943
  • [42] Stabilization control of a class of discrete-time Markov jump singular systems
    Sheng, Li
    Yang, Hui-Zhong
    Kongzhi yu Juece/Control and Decision, 2010, 25 (08): : 1189 - 1194
  • [43] Adaptive optimal output feedback tracking control for unknown discrete-time linear systems using a combined reinforcement Q-learning and internal model method
    Sun, Weijie
    Zhao, Guangyue
    Peng, Yunjian
    IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (18): : 3075 - 3086
  • [44] Asynchronous Control for Discrete-Time Hidden Markov Jump Power Systems
    Kuppusamy, Subramanian
    Joo, Young Hoon
    Kim, Han Sol
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9943 - 9948
  • [45] Stochastic optimal control problems of discrete-time Markov jump systems
    Song, Teng
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05): : 2551 - 2570
  • [46] Stochastic H2/H∞ Control of Discrete-Time Periodic Markov Jump Systems with Detectability
    Hou, Ting
    Ma, Hongji
    2015 54TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2015, : 530 - 535
  • [47] H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning
    Li, Jinna
    Xiao, Zhenfei
    IEEE ACCESS, 2020, 8 (08): : 28831 - 28846
  • [48] Optimal tracking control for discrete-time systems by model-free off-policy Q-learning approach
    Li, Jinna
    Yuan, Decheng
    Ding, Zhengtao
    2017 11TH ASIAN CONTROL CONFERENCE (ASCC), 2017, : 7 - 12
  • [49] H∞ control for discrete-time nonlinear Markov jump systems with multiplicative noise and sector constraint
    Ma, Hongji
    Jia, Yingmin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (16) : 2347 - 2364
  • [50] Robust reliable H∞ control for discrete-time Markov jump linear systems with actuator failures
    Chen Jiaorong & Liu Fei Inst. of Automation
    Journal of Systems Engineering and Electronics, 2008, (05) : 965 - 973