H∞ Tracking learning control for discrete-time Markov jump systems: A parallel off-policy reinforcement learning

被引:1
|
作者
Zhang, Xuewen [1 ]
Xia, Jianwei [2 ]
Wang, Jing [1 ]
Chen, Xiangyong [3 ]
Shen, Hao [1 ]
机构
[1] Anhui Univ Technol, Sch Elect & Informat Engn, China Int Sci & Technol Cooperat Base Intelligent, Maanshan 243002, Peoples R China
[2] Liaocheng Univ, Sch Math Sci, Maanshan 252059, Peoples R China
[3] Linyi Univ, Sch Automat & Elect Engn, Linyi 276005, Peoples R China
来源
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2023年 / 360卷 / 18期
基金
中国国家自然科学基金;
关键词
FEEDBACK-CONTROL; LINEAR-SYSTEMS; DESIGN;
D O I
10.1016/j.jfranklin.2023.10.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with the H infinity tracking control problem for a class of linear discrete-time Markov jump systems, in which the knowledge of system dynamics is not required. First, combined with reinforcement learning, a novel Bellman equation and the augmented coupled game algebraic Riccati equation are presented to derived the optimal control policy for the augmented discrete-time Markov jump systems. Moreover, based on the augmented system, a newly constructed system is given to collect the input and output data, which solves the problem that the coupling term in the discrete-time Markov jump systems is difficult to solve. Subsequently, a novel model-free algorithm is designed that does not need the dynamic information of the original system. Finally, a numerical example is given to verify the effectiveness of the proposed approach.
引用
收藏
页码:14878 / 14890
页数:13
相关论文
共 50 条
  • [41] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    Wei, Qinglai
    NEURAL NETWORKS, 2014, 55 : 30 - 41
  • [42] Self-triggered neural tracking control for discrete-time nonlinear systems via adaptive critic learning
    Hu, Lingzhi
    Wang, Ding
    Wang, Gongming
    Qiao, Junfei
    NEURAL NETWORKS, 2025, 186
  • [43] Asynchronous Dissipative Control for a Class of Discrete-time Singular Markov Jump Systems
    Lu, Xiao
    Yan, Jiaqiang
    Wang, Haixia
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 176 - 179
  • [44] Asynchronous Static Output Feedback Control of Discrete-time Markov Jump Systems
    Dong, Shanling
    Wu, Zheng-Guang
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 5957 - 5962
  • [45] Resilient Asynchronous H∞ Control for Discrete-Time Markov Jump Singularly Perturbed Systems Based on Hidden Markov Model
    Li, Feng
    Xu, Shengyuan
    Zhang, Baoyong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (08): : 2860 - 2869
  • [46] Mixed H2/H8 control for discrete-time periodic Markov jump systems with quantization effects and packet loss compensation
    Hua, Mingang
    Zhang, Fan
    Deng, Feiqi
    Fei, Juntao
    Chen, Hua
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 50
  • [47] Linear Quadratic Optimal Control for Discrete-time Markov Jump Linear Systems
    Han, Chunyan
    Li, Hongdan
    Wang, Wei
    Zhang, Huanshui
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 769 - 774
  • [48] Control for Discrete-time Fuzzy Markov Jump Systems with Mode-dependent Antecedent Parts
    Zhan, Lixian
    Yang, Ting
    Wu, Fen
    2014 IEEE 23RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2014, : 2306 - 2311
  • [49] Cooperative Output Regulation Quadratic Control for Discrete-Time Heterogeneous Multiagent Markov Jump Systems
    Dong, Shanling
    Liu, Lu
    Feng, Gang
    Liu, Meiqin
    Wu, Zheng-Guang
    Zheng, Ronghao
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9882 - 9892
  • [50] Stochastic H2 Optimal Control of Discrete-Time Markov Jump Systems with Periodic Coefficients
    Ma, Hongji
    Jia, Yingmin
    Du, Junping
    Yu, Fashan
    2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 1640 - 1645