Value set iteration for two-person zero-sum Markov games

被引：1

作者：

Chang, Hyeong Soo ^{[1
]}

机构：

[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea

来源：

AUTOMATICA | 2017年 / 76卷

关键词：

Two-person zero-sum Markov game; Value iteration; Policy iteration; Stochastic game;

D O I：

10.1016/j.automatica.2016.10.010

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a novel exact algorithm called "value set iteration" (VSI) for solving two-person zero-sum Markov games (MGs) as a generalization of value iteration (VI) and as a general framework of combining multiple solution methods. We introduce a novel operator in the value function space and iteratively apply the operator with any sequence of the set of policies, extending Chang's VSI for MDPs into the MG setting. We show that VSI for MGs converges to the equilibrium value function with at least linear convergence rate and establish that VSI can potentially improve the convergence speed in terms of the number of iterations by proper setting of the sequence of the set of policies. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：61 / 64

页数：4

共 50 条

[1] Two-person zero-sum Markov games: Receding horizon approach
Chang, HS
Marcus, SI
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (11) : 1951 - 1961
[2] The Design of ϵ-Optimal Strategy for Two-Person Zero-Sum Markov Games
Xie, Kaiyun
Xiong, Junlin
IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2349 - 2354
[3] TWO-PERSON ZERO-SUM STOCHASTIC GAMES
Baykal-Guersoy, Melike
ANNALS OF OPERATIONS RESEARCH, 1991, 28 (01) : 135 - 152
[4] A perturbation on two-person zero-sum games
Kimura, Y
Sawasaki, Y
Tanaka, K
ADVANCES IN DYNAMIC GAMES AND APPLICATIONS, 2000, 5 : 279 - 288
[5] On the value of two-person zero-sum linear quadratic differential games
Zhang, Pingjian
Deng, Huifang
Xi, Jianqing
2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 714 - 718
[6] On the solution of two-person zero-sum matrix games
Stefanov, Stefan M.
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2024, 45 (03): : 649 - 657
[7] 'TWO-PERSON/ZERO-SUM'
NEMEROV, H
KENYON REVIEW, 1986, 8 (03): : 74 - 74
[8] ON ZERO-SUM TWO-PERSON UNDISCOUNTED SEMI-MARKOV GAMES WITH A MULTICHAIN STRUCTURE
Mondal, Prasenjit
ADVANCES IN APPLIED PROBABILITY, 2017, 49 (03) : 826 - 849
[9] Linear Programming and Zero-Sum Two-Person Undiscounted Semi-Markov Games
Mondal, Prasenjit
ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2015, 32 (06)
[10] Perfect information two-person zero-sum markov games with imprecise transition probabilities
Chang, Hyeong Soo
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2006, 64 (02) : 335 - 351

← 1 2 3 4 5 →