Finite-horizon Q-learning for discrete-time zero-sum games with application to H∞$$ {H}_{\infty } $$ control

被引：1

作者：

Liu, Mingxiang ^{[1
,2
]}

Cai, Qianqian ^{[1
,2
,4
]}

Meng, Wei ^{[1
,2
]}

Li, Dandan ^{[1
,2
]}

Fu, Minyue ^{[3
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China

[2] Guangdong Prov Key Lab Intelligent Decis & Coopera, Guangzhou, Peoples R China

[3] Southern Univ Sci & Technol, Dept Mech & Energy Engn, Shenzhen, Peoples R China

[4] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China

来源：

ASIAN JOURNAL OF CONTROL | 2023年 / 25卷 / 04期

基金：

中国国家自然科学基金;

关键词：

finite-horizon; H(infinity)control; linear quadratic (LQ) control; Q-learning; zero-sum games; SYSTEMS;

D O I：

10.1002/asjc.3027

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we investigate the optimal control strategies for model-free zero-sum games involving the H(infinity )control. The key contribution is the development of a Q-learning algorithm for linear quadratic games without knowing the system dynamics. The finite-horizon setting is more practical than the infinite-horizon setting, but it is difficult to solve the time-varying Riccati equation associated with the finite-horizon setting directly. The proposed algorithm is shown to solve the time-varying Riccati equation iteratively without the use of models, and numerical experiments on aircraft dynamics demonstrate the algorithm's efficiency.

引用

页码：3160 / 3168

页数：9

共 50 条

[21] Zero-Sum Games for Finite-Horizon Semi-Markov Processes Under the Probability Criterion
Huang, Xiangxiang
Guo, Xianping
Wen, Xin
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (09) : 5560 - 5567
[22] Sufficient Conditions for Optimality in Finite-Horizon Two-Player Zero-Sum Hybrid Games
Leudo, Santiago J.
Sanfelice, Ricardo G.
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3268 - 3273
[23] Adaptive Critic Control of Linear Discrete-Time Zero-Sum Games with Stability Guarantee
Ren, Jin
Wang, Ding
Li, Menghua
2024 5th International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2024, 2024, : 988 - 992
[24] Minimax Q-learning design for H∞ control of linear discrete-time systems
Li, Xinxing
Xi, Lele
Zha, Wenzhong
Peng, Zhihong
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (03) : 438 - 451
[25] Discrete-time zero-sum Markov games with first passage criteria
Liu, Qiuli
Huang, Xiangxiang
OPTIMIZATION, 2017, 66 (04) : 571 - 587
[26] H∞$$ {H}_{\infty } $$ constraint Pareto optimal control for discrete-time Markov jump linear stochastic systems in finite horizon
Jiang, Xiushan
Cui, Kai
Zhao, Dongya
ASIAN JOURNAL OF CONTROL, 2023, 25 (04) : 2893 - 2907
[27] Discrete-Time H∞ Preview Control Problem in Finite Horizon
Wang, Hongxia
Zhang, Huanshui
Xie, Lihua
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[28] Continuous-time zero-sum games for Markov chains with risk-sensitive finite-horizon cost criterion
Golui, Subrata
Pal, Chandan
STOCHASTIC ANALYSIS AND APPLICATIONS, 2022, 40 (01) : 78 - 95
[29] Dichotomy value iteration with parallel learning design towards discrete-time zero-sum games
Wang, Jiangyu
Wang, Ding
Li, Xin
Qiao, Junfei
NEURAL NETWORKS, 2023, 167 : 751 - 762
[30] Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games
Zhang X.
Bo Y.-C.
Cui L.-L.
Zhang, Xin (zhangxin@upc.edu.cn), 2018, South China University of Technology (35): : 619 - 626

← 1 2 3 4 5 →