Finite-horizon Q-learning for discrete-time zero-sum games with application to H∞$$ {H}_{\infty } $$ control

被引:1
|
作者
Liu, Mingxiang [1 ,2 ]
Cai, Qianqian [1 ,2 ,4 ]
Meng, Wei [1 ,2 ]
Li, Dandan [1 ,2 ]
Fu, Minyue [3 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
[2] Guangdong Prov Key Lab Intelligent Decis & Coopera, Guangzhou, Peoples R China
[3] Southern Univ Sci & Technol, Dept Mech & Energy Engn, Shenzhen, Peoples R China
[4] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
finite-horizon; H(infinity)control; linear quadratic (LQ) control; Q-learning; zero-sum games; SYSTEMS;
D O I
10.1002/asjc.3027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we investigate the optimal control strategies for model-free zero-sum games involving the H(infinity )control. The key contribution is the development of a Q-learning algorithm for linear quadratic games without knowing the system dynamics. The finite-horizon setting is more practical than the infinite-horizon setting, but it is difficult to solve the time-varying Riccati equation associated with the finite-horizon setting directly. The proposed algorithm is shown to solve the time-varying Riccati equation iteratively without the use of models, and numerical experiments on aircraft dynamics demonstrate the algorithm's efficiency.
引用
收藏
页码:3160 / 3168
页数:9
相关论文
共 50 条
  • [1] Output feedback Q-learning for discrete-time finite-horizon zero-sum games with application to the H? control
    Liu, Mingxiang
    Cai, Qianqian
    Li, Dandan
    Meng, Wei
    Fu, Minyue
    NEUROCOMPUTING, 2023, 529 : 48 - 55
  • [2] Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
    Rizvi, Syed Ali Asad
    Lin, Zongli
    AUTOMATICA, 2018, 95 : 213 - 221
  • [3] Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    AUTOMATICA, 2007, 43 (03) : 473 - 481
  • [4] Adaptive critic designs for discrete-time zero-sum games with application to H∞ control
    Al-Tamimi, Asma
    Abu-Khalaf, Murad
    Lewis, Frank L.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 240 - 247
  • [5] Model-free finite-horizon optimal control of discrete-time two-player zero-sum games
    Wang, Wei
    Chen, Xin
    Du, Jianhua
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (01) : 167 - 179
  • [6] Neural Q-learning for discrete-time nonlinear zero-sum games with adjustable convergence rate
    Wang, Yuan
    Wang, Ding
    Zhao, Mingming
    Liu, Nan
    Qiao, Junfei
    NEURAL NETWORKS, 2024, 175
  • [7] H∞ Control for Discrete-time Linear Systems by Integrating Off-policy Q-learning and Zero-sum Game
    Li, Jinna
    Ding, Zhengtao
    Yang, Chunyu
    Niu, Hong
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 817 - 822
  • [8] H∞ Consensus for Discrete-Time Fractional-Order Multi-Agent Systems With Disturbance via Q-Learning in Zero-Sum Games
    An, Chunlan
    Su, Housheng
    Chen, Shiming
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04): : 2803 - 2814
  • [9] Output Feedback Q-Learning for Linear-Quadratic Discrete-Time Finite-Horizon Control Problems
    Calafiore, Giuseppe C.
    Possieri, Corrado
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3274 - 3281
  • [10] Stochastic Zero-Sum Differential Games and H∞ Control of Discrete-time Markov Jump Systems
    Zhou Haiying
    Zhu Huainian
    Zhang Chengke
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 151 - 156