Finite-horizon Q-learning for discrete-time zero-sum games with application to H∞$$ {H}_{\infty } $$ control

被引:1
|
作者
Liu, Mingxiang [1 ,2 ]
Cai, Qianqian [1 ,2 ,4 ]
Meng, Wei [1 ,2 ]
Li, Dandan [1 ,2 ]
Fu, Minyue [3 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
[2] Guangdong Prov Key Lab Intelligent Decis & Coopera, Guangzhou, Peoples R China
[3] Southern Univ Sci & Technol, Dept Mech & Energy Engn, Shenzhen, Peoples R China
[4] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
finite-horizon; H(infinity)control; linear quadratic (LQ) control; Q-learning; zero-sum games; SYSTEMS;
D O I
10.1002/asjc.3027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we investigate the optimal control strategies for model-free zero-sum games involving the H(infinity )control. The key contribution is the development of a Q-learning algorithm for linear quadratic games without knowing the system dynamics. The finite-horizon setting is more practical than the infinite-horizon setting, but it is difficult to solve the time-varying Riccati equation associated with the finite-horizon setting directly. The proposed algorithm is shown to solve the time-varying Riccati equation iteratively without the use of models, and numerical experiments on aircraft dynamics demonstrate the algorithm's efficiency.
引用
收藏
页码:3160 / 3168
页数:9
相关论文
共 50 条
  • [21] Zero-Sum Games for Finite-Horizon Semi-Markov Processes Under the Probability Criterion
    Huang, Xiangxiang
    Guo, Xianping
    Wen, Xin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (09) : 5560 - 5567
  • [22] Sufficient Conditions for Optimality in Finite-Horizon Two-Player Zero-Sum Hybrid Games
    Leudo, Santiago J.
    Sanfelice, Ricardo G.
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3268 - 3273
  • [23] Adaptive Critic Control of Linear Discrete-Time Zero-Sum Games with Stability Guarantee
    Ren, Jin
    Wang, Ding
    Li, Menghua
    2024 5th International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2024, 2024, : 988 - 992
  • [24] Minimax Q-learning design for H∞ control of linear discrete-time systems
    Li, Xinxing
    Xi, Lele
    Zha, Wenzhong
    Peng, Zhihong
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (03) : 438 - 451
  • [25] Discrete-time zero-sum Markov games with first passage criteria
    Liu, Qiuli
    Huang, Xiangxiang
    OPTIMIZATION, 2017, 66 (04) : 571 - 587
  • [26] H∞$$ {H}_{\infty } $$ constraint Pareto optimal control for discrete-time Markov jump linear stochastic systems in finite horizon
    Jiang, Xiushan
    Cui, Kai
    Zhao, Dongya
    ASIAN JOURNAL OF CONTROL, 2023, 25 (04) : 2893 - 2907
  • [27] Discrete-Time H∞ Preview Control Problem in Finite Horizon
    Wang, Hongxia
    Zhang, Huanshui
    Xie, Lihua
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [28] Continuous-time zero-sum games for Markov chains with risk-sensitive finite-horizon cost criterion
    Golui, Subrata
    Pal, Chandan
    STOCHASTIC ANALYSIS AND APPLICATIONS, 2022, 40 (01) : 78 - 95
  • [29] Dichotomy value iteration with parallel learning design towards discrete-time zero-sum games
    Wang, Jiangyu
    Wang, Ding
    Li, Xin
    Qiao, Junfei
    NEURAL NETWORKS, 2023, 167 : 751 - 762
  • [30] Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games
    Zhang X.
    Bo Y.-C.
    Cui L.-L.
    Zhang, Xin (zhangxin@upc.edu.cn), 2018, South China University of Technology (35): : 619 - 626