Bounded-Memory Strategies in Partial-Information Games

被引:0
作者
Bose, Sougata [1 ]
Ibsen-Jensen, Rasmus [1 ]
Totzke, Patrick [1 ]
机构
[1] Univ Liverpool, Liverpool, England
来源
PROCEEDINGS OF THE 39TH ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, LICS 2024 | 2024年
基金
英国工程与自然科学研究理事会;
关键词
Finite-memory strategies; Equilibria; Approximation; Imperfect information games;
D O I
10.1145/3661814.3662096
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the computational complexity of solving stochastic games with mean-payoff objectives. Instead of identifying special classes in which simple strategies are sufficient to play epsilon-optimally, or form epsilon-Nash equilibria, we consider general partial-information multiplayer games and ask what can be achieved with (and against) finite-memory strategies up to a given bound on the memory. We show NP-hardness for approximating zero-sum values, already with respect to memoryless strategies and for 1-player reachability games. On the other hand, we provide upper bounds for solving games of any fixed number of players k. We show that one can decide in polynomial space if, for a given k-player game, epsilon >= 0 and bound b, there exists an epsilon-Nash equilibrium in which all strategies use at most b memory modes. For given epsilon > 0, finding an epsilon-Nash equilibrium with respect to b-bounded strategies can be done in FNPNP. Similarly for 2-player zero-sum games, finding a b-bounded strategy that, against all b-bounded opponent strategies, guarantees an outcome within.. of a given value, can be done in FNPNP. Our constructions apply to parity objectives with minimal simplifications. Our results improve the status quo in several well-known special cases of games. In particular, for 2-player zero-sum concurrent mean-payoff games, one can approximate ordinary zero-sum values (without restricting admissible strategies) in FNPNP.
引用
收藏
页数:14
相关论文
共 40 条
[1]   On the Encoding and Solving of Partial Information Games [J].
Amoussou-Guenou, Yackolley ;
Baarir, Souheib ;
Potop-Butucaru, Maria ;
Sznajder, Nathalie ;
Tible, Leo ;
Tixeuil, Sebastien .
NETWORKED SYSTEMS, NETYS 2020, 2021, 12129 :60-76
[2]  
Basu S., 2006, Algorithms and Computation in Mathematics, DOI [10.1007/3-540-33099-2, DOI 10.1007/3-540-33099-2]
[3]  
Billingsley P., 1995, PROBABILITY MEASURE, V3rd ed.
[4]   BIG MATCH [J].
BLACKWELL, D ;
FERGUSON, TS .
ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (01) :159-+
[5]  
Bordais B, 2023, Arxiv, DOI arXiv:2311.14373
[6]  
Bordais Benjamin, 2021, LIPICS, V213, DOI [10.4230/ LIPIcs. FSTTCS. 2021.41, DOI 10.4230/LIPICS.FSTTCS.2021.41]
[7]  
Bordais Benjamin, 2022, LIPICS, V250, DOI [10.4230/LIPIcs.FSTTCS.2022.33, DOI 10.4230/LIPICS.FSTTCS.2022.33]
[8]  
Bose S, 2024, Arxiv, DOI arXiv:2405.09406
[9]  
Bouyer BRV23 Patricia, 2023, TheoretiCS, V2, DOI [DOI 10.46298/THEORETICS.23.1, 10.46298/THEORETICS.23.1]
[10]  
Bouyer P, 2022, LOG METH COMPUT SCI, V18, DOI [10.46298/LMCS-18(1, 10.46298/LMCS-18(1:11)2022]