Mean Field Markov Decision Processes

被引：4

作者：

Baeuerle, Nicole ^{[1
]}

机构：

[1] Karlsruhe Inst Technol KIT, Dept Math, D-76128 Karlsruhe, Germany

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2023年 / 88卷 / 01期

关键词：

Mean-field control; Markov decision process; Average reward; INTERACTING OBJECTS; AVERAGE OPTIMALITY; DISCRETE; POLICIES; SYSTEMS; CHAINS; GAMES;

D O I：

10.1007/s00245-023-09985-1

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

We consider mean-field control problems in discrete time with discounted reward, infinite time horizon and compact state and action space. The existence of optimal policies is shown and the limiting mean-field problem is derived when the number of individuals tends to infinity. Moreover, we consider the average reward problem and show that the optimal policy in this mean-field limit is e-optimal for the discounted problem if the number of individuals is large and the discount factor close to one. This result is very helpful, because it turns out that in the special case when the reward does only depend on the distribution of the individuals, we obtain a very interesting subclass of problems where an average reward optimal policy can be obtained by first computing an optimal measure from a static optimization problem and then achieving it with Markov Chain Monte Carlo methods. We give two applications: Avoiding congestion an a graph and optimal positioning on a market place which we solve explicitly.

引用

页数：36

共 50 条

[1] Mean Field Markov Decision Processes
Nicole Bäuerle
Applied Mathematics & Optimization, 2023, 88
[2] MEAN-FIELD MARKOV DECISION PROCESSES WITH COMMON NOISE AND OPEN-LOOP CONTROLS
Motte, Mederic
Huyen Pham
ANNALS OF APPLIED PROBABILITY, 2022, 32 (02): : 1421 - 1458
[3] Optimizing the Expected Mean Payoff in Energy Markov Decision Processes
Brazdil, Tomas
Kucera, Antonin
Novotny, Petr
AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, ATVA 2016, 2016, 9938 : 32 - 49
[4] Energy and Mean-Payoff Parity Markov Decision Processes
Chatterjee, Krishnendu
Doyen, Laurent
MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2011, 2011, 6907 : 206 - 218
[5] Reversible Markov decision processes and the Gaussian free field
Anantharam, Venkat
SYSTEMS & CONTROL LETTERS, 2022, 169
[6] Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
Xia, Li
Ma, Shuai
MATHEMATICS OF OPERATIONS RESEARCH, 2025,
[7] Equilibrium in misspecified Markov decision processes
Esponda, Ignacio
Pouzo, Demian
THEORETICAL ECONOMICS, 2021, 16 (02) : 717 - 757
[8] Markov Decision Processes
Bäuerle N.
Rieder U.
Jahresbericht der Deutschen Mathematiker-Vereinigung, 2010, 112 (4) : 217 - 243
[9] Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
Xia, Li
PRODUCTION AND OPERATIONS MANAGEMENT, 2020, 29 (12) : 2808 - 2827
[10] Mean-variance optimization of discrete time discounted Markov decision processes
Xia, Li
AUTOMATICA, 2018, 88 : 76 - 82

← 1 2 3 4 5 →