Model-free finite-horizon optimal control of discrete-time two-player zero-sum games

被引：3

作者：

Wang, Wei ^{[1
,2
,3
]}

Chen, Xin ^{[2
,3
,4
]}

Du, Jianhua ^{[1
]}

机构：

[1] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan, Peoples R China

[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan, Peoples R China

[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan, Peoples R China

[4] China Univ Geosci, Sch Automat, Wuhan, Peoples R China

来源：

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE | 2023年 / 54卷 / 01期

关键词：

Q-function; finite horizon; optimal control; zero-sum game; H-INFINITY-CONTROL; DIFFERENTIAL-GAMES; SYSTEMS;

D O I：

10.1080/00207721.2022.2111236

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventionally, as the system dynamics is known, the finite-horizon optimal control of zero-sum games relies on solving the time-varying Riccati equations. In this paper, with unknown system dynamics being considered, a Q-function-based finite-horizon control method is introduced to approximate the solutions of the time-varying Riccati equations. First, a time-varying Q-function explicitly dependent on the time-varying control and disturbance is defined. Then the defined time-varying Q-function is utilised to represent the time-varying control and disturbance which are equivalent to the solutions of the time-varying Riccati equations by relaxing the system dynamics. Finally, a model-free method is introduced to approximate the defined time-varying Q-function. Simulation studies are conducted to demonstrate the validity of the developed method.

引用

页码：167 / 179

页数：13

共 50 条

[21] Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
Chen, Zixiang
Zhou, Dongruo
Gu, Quanquan
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
[22] Pure strategy equilibria in symmetric two-player zero-sum games
Peter Duersch
Jörg Oechssler
Burkhard C. Schipper
International Journal of Game Theory, 2012, 41 : 553 - 564
[23] Optimality and Asymptotic Stability in Two-Player Zero-Sum Hybrid Games
Leudo, Santiago J.
Sanfelice, Ricardo G.
HSCC 2022: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS-IOT WEEK 2022), 2022,
[24] Generating Dominant Strategies for Continuous Two-Player Zero-Sum Games
Vazquez-Chanlatte, Marcell J.
Ghosh, Shromona
Raman, Vasumathi
Sangiovanni-Vincentelli, Alberto
Seshia, Sanjit A.
IFAC PAPERSONLINE, 2018, 51 (16): : 7 - 12
[25] Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games
Perolat, Julien
Scherrer, Bruno
Piot, Bilal
Pietquin, Olivier
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1321 - 1329
[26] When are Offline Two-Player Zero-Sum Markov Games Solvable?
Cui, Qiwen
Du, Simon S.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[27] Two-player zero-sum stochastic differential games with regime switching
Lv, Siyu
AUTOMATICA, 2020, 114
[28] Pure strategy equilibria in symmetric two-player zero-sum games
Duersch, Peter
Oechssler, Joerg
Schipper, Burkhard C.
INTERNATIONAL JOURNAL OF GAME THEORY, 2012, 41 (03) : 553 - 564
[29] Policy Gradient Algorithm in Two-Player Zero-Sum Markov Games
Li Y.
Zhou J.
Feng Y.
Feng Y.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (01): : 81 - 91
[30] Algorithms for uniform optimal strategies in two-player zero-sum stochastic games with perfect information
Avrachenkov, Konstantin
Cottatellucci, Laura
Maggi, Lorenzo
OPERATIONS RESEARCH LETTERS, 2012, 40 (01) : 56 - 60

← 1 2 3 4 5 →