ARENA-INDEPENDENT FINITE-MEMORY DETERMINACY IN STOCHASTIC GAMES

被引：1

作者：

Bouyer, Patricia ^{[1
]}

Oualhadj, Youssouf ^{[2
]}

Randour, Mickael ^{[3
,4
]}

Vandenhove, Pierre ^{[1
,3
,4
]}

机构：

[1] Univ Paris Saclay, CNRS, Lab Methodes Formelles, ENS Paris Saclay, F-91190 Gif Sur Yvette, France

[2] Univ Paris Est Creteil, LACL, F-94010 Creteil, France

[3] FRS FNRS, Brussels, Belgium

[4] UMONS Univ Mons, Mons, Belgium

来源：

LOGICAL METHODS IN COMPUTER SCIENCE | 2023年 / 19卷 / 04期

关键词：

two-player games on graphs; stochastic games; Markov decision processes; finite-memory determinacy; optimal strategies; COMPLEXITY; AUTOMATA;

D O I：

10.46298/LMCS-19(4:18)2023

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games (i.e., Markov decision processes). Third, we characterize the sufficiency of AIFM strategies through two intuitive properties of objectives. This work extends a line of research started on deterministic games to stochastic ones.

引用

页码：1 / 18

页数：51

共 58 条

[1] First-cycle games
Aminof, Benjamin
Rubin, Sasha
[J]. INFORMATION AND COMPUTATION, 2017, 254 : 195 - 216
[2] [Anonymous], 2004, An introduction to game theory
[3] [Anonymous], 2014, LIPICS, DOI DOI 10.4230/LIPICS
[4] Baier C, 2008, PRINCIPLES OF MODEL CHECKING, P1
[5] Berthon Raphael, 2017, 44 INT C AUTOMATA LA, V80, p121:1
[6] Exploring the boundary of half-positionality
Bianco, Alessandro
Faella, Marco
Mogavero, Fabio
Murano, Aniello
[J]. ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2011, 62 (1-2) : 55 - 77
[7] Bouyer P, 2008, LECT NOTES COMPUT SC, V5215, P33, DOI 10.1007/978-3-540-85778-5_4
[8] Bouyer P, 2022, LOG METH COMPUT SCI, V18, DOI [10.46298/LMCS-18(1, 10.46298/LMCS-18(1:11)2022]
[9] Average-energy games
Bouyer, Patricia
Markey, Nicolas
Randour, Mickael
Larsen, Kim G.
Laursen, Simon
[J]. ACTA INFORMATICA, 2018, 55 (02) : 91 - 127
[10] Bounding Average-Energy Games
Bouyer, Patricia
Hofman, Piotr
Markey, Nicolas
Randour, Mickael
Zimmermann, Martin
[J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES (FOSSACS 2017), 2017, 10203 : 179 - 195

← 1 2 3 4 5 6 →