ARENA-INDEPENDENT FINITE-MEMORY DETERMINACY IN STOCHASTIC GAMES

被引:1
作者
Bouyer, Patricia [1 ]
Oualhadj, Youssouf [2 ]
Randour, Mickael [3 ,4 ]
Vandenhove, Pierre [1 ,3 ,4 ]
机构
[1] Univ Paris Saclay, CNRS, Lab Methodes Formelles, ENS Paris Saclay, F-91190 Gif Sur Yvette, France
[2] Univ Paris Est Creteil, LACL, F-94010 Creteil, France
[3] FRS FNRS, Brussels, Belgium
[4] UMONS Univ Mons, Mons, Belgium
关键词
two-player games on graphs; stochastic games; Markov decision processes; finite-memory determinacy; optimal strategies; COMPLEXITY; AUTOMATA;
D O I
10.46298/LMCS-19(4:18)2023
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games (i.e., Markov decision processes). Third, we characterize the sufficiency of AIFM strategies through two intuitive properties of objectives. This work extends a line of research started on deterministic games to stochastic ones.
引用
收藏
页码:1 / 18
页数:51
相关论文
共 58 条
  • [11] Bouyer Patricia, LIPIcs, V203
  • [12] One-Counter Stochastic Games
    Brazdil, Tomas
    Brozek, Vaclav
    Etessami, Kousha
    [J]. IARCS ANNUAL CONFERENCE ON FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE (FSTTCS 2010), 2010, 8 : 108 - 119
  • [13] LIFE IS RANDOM, TIME IS NOT: MARKOV DECISION PROCESSES WITH WINDOW OBJECTIVES
    Brihaye, Thomas
    Delgrange, Florent
    Randour, Mickael
    Oualhadj, Youssouf
    [J]. LOGICAL METHODS IN COMPUTER SCIENCE, 2020, 16 (04) : 1 - 13
  • [14] Meet your expectations with guarantees: Beyond worst-case synthesis in quantitative games
    Bruyere, Veronique
    Filiot, Emmanuel
    Randour, Mickael
    Raskin, Jean-Francois
    [J]. INFORMATION AND COMPUTATION, 2017, 254 : 259 - 295
  • [15] Window Parity Games: An Alternative Approach Toward Parity Games with Time Bounds
    Bruyere, Veronique
    Hautem, Quentin
    Randour, Mickael
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (226): : 135 - 148
  • [16] Bruyere Veronique, 2019, LIPIcs, V140, DOI DOI 10.4230/LIPICS
  • [17] Chatterjee K, 2004, INT CONF QUANT EVAL, P206
  • [18] Chatterjee K, 2007, LECT NOTES COMPUT SC, V4423, P153
  • [19] Stochastic Games with Lexicographic Reachability-Safety Objectives
    Chatterjee, Krishnendu
    Katoen, Joost-Pieter
    Weininger, Maximilian
    Winkler, Tobias
    [J]. COMPUTER AIDED VERIFICATION, PT II, 2020, 12225 : 398 - 420
  • [20] UNIFYING TWO VIEWS ON MULTIPLE MEAN-PAYOFF OBJECTIVES IN MARKOV DECISION PROCESSES
    Chatterjee, Krishnendu
    Kretinska, Zuzana
    Kretinsky, Jan
    [J]. LOGICAL METHODS IN COMPUTER SCIENCE, 2017, 13 (02)