Value set iteration for two-person zero-sum Markov games

被引：1

作者：

Chang, Hyeong Soo ^{[1
]}

机构：

[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea

来源：

AUTOMATICA | 2017年 / 76卷

关键词：

Two-person zero-sum Markov game; Value iteration; Policy iteration; Stochastic game;

D O I：

10.1016/j.automatica.2016.10.010

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a novel exact algorithm called "value set iteration" (VSI) for solving two-person zero-sum Markov games (MGs) as a generalization of value iteration (VI) and as a general framework of combining multiple solution methods. We introduce a novel operator in the value function space and iteratively apply the operator with any sequence of the set of policies, extending Chang's VSI for MDPs into the MG setting. We show that VSI for MGs converges to the equilibrium value function with at least linear convergence rate and establish that VSI can potentially improve the convergence speed in terms of the number of iterations by proper setting of the sequence of the set of policies. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：61 / 64

页数：4

共 42 条

[21] Limit Optimal Trajectories in Zero-Sum Stochastic Games
Sorin, Sylvain
Vigeral, Guillaume
DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (02) : 555 - 572
[22] Zero-sum constrained stochastic games with independent state processes
Altman, E
Avrachenkov, K
Marquez, R
Miller, G
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2005, 62 (03) : 375 - 386
[23] Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
Yan, Rui
Santos, Gabriel
Norman, Gethin
Parker, David
Kwiatkowska, Marta
INFORMATION AND COMPUTATION, 2024, 300
[24] Zero-sum constrained stochastic games with independent state processes
Eitan Altman
Konstantin Avrachenkov
Richard Marquez
Gregory Miller
Mathematical Methods of Operations Research, 2005, 62 : 375 - 386
[25] Value set iteration for Markov decision processes
Chang, Hyeong Soo
AUTOMATICA, 2014, 50 (07) : 1940 - 1943
[26] GPI-Based design for partially unknown nonlinear two-player zero-sum games
Yu, Lin
Xiong, Junlin
Xie, Min
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (03): : 2068 - 2088
[27] Min-max adaptive dynamic programming for zero-sum differential games
Sarbaz, Mohammad
Sun, Wei
INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2886 - 2895
[28] Solving zero-sum one-sided partially observable stochastic games
Horak, Karel
Bosansky, Branislav
Kovarik, Vojtech
Kiekintveld, Christopher
ARTIFICIAL INTELLIGENCE, 2023, 316
[29] Adaptive policy for two finite Markov chains zero-sum stochastic game with unknown transition matrices and average payoffs
Najim, K
Poznyak, AS
Gomez, E
AUTOMATICA, 2001, 37 (07) : 1007 - 1018
[30] Generic uniqueness of the bias vector of finite zero-sum stochastic games with perfect information
Akian, Marianne
Gaubert, Stephane
Hochart, Antoine
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2018, 457 (02) : 1038 - 1064

← 1 2 3 4 5 →