A Version of the Euler Equation in Discounted Markov Decision Processes

被引:2
|
作者
Cruz-Suarez, H. [1 ]
Zacarias-Espinoza, G. [1 ]
Vazquez-Guevara, V. [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, CU, Puebla 72570, PUE, Mexico
关键词
UNCERTAINTY; GROWTH;
D O I
10.1155/2012/103698
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper deals with Markov decision processes (MDPs) on Euclidean spaces with an infinite horizon. An approach to study this kind of MDPs is using the dynamic programming technique (DP). Then the optimal value function is characterized through the value iteration functions. The paper provides conditions that guarantee the convergence of maximizers of the value iteration functions to the optimal policy. Then, using the Euler equation and an envelope formula, the optimal solution of the optimal control problem is obtained. Finally, this theory is applied to a linear-quadratic control problem in order to find its optimal policy.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Discounted Markov decision processes with fuzzy costs
    Abdellatif Semmouri
    Mostafa Jourhmane
    Zineb Belhallaj
    Annals of Operations Research, 2020, 295 : 769 - 786
  • [2] Weighted discounted Markov decision processes with perturbation
    Liu Ke
    Acta Mathematicae Applicatae Sinica, 1999, 15 (2) : 183 - 189
  • [3] Discounted Markov decision processes with utility constraints
    Kadota, Y
    Kurano, M
    Yasuda, M
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2006, 51 (02) : 279 - 284
  • [4] Discounted Markov decision processes with fuzzy costs
    Semmouri, Abdellatif
    Jourhmane, Mostafa
    Belhallaj, Zineb
    ANNALS OF OPERATIONS RESEARCH, 2020, 295 (02) : 769 - 786
  • [5] Discounted cost Markov decision processes with a constraint
    Wakuta, K
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 1998, 12 (02) : 177 - 187
  • [6] THE VARIANCE OF DISCOUNTED MARKOV DECISION-PROCESSES
    SOBEL, MJ
    JOURNAL OF APPLIED PROBABILITY, 1982, 19 (04) : 794 - 802
  • [7] Hierarchical algorithms for discounted and weighted Markov decision processes
    Abbad, M
    Daoui, C
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 58 (02) : 237 - 245
  • [8] Discounted Markov Decision Processes for Small Noise Intensities
    Cruz-Suarez, Hugo
    Ilhuicatzi-Roldan, Rocio
    RECENT ADVANCES IN APPLIED MATHEMATICS, 2009, : 245 - +
  • [9] Constrained discounted semi-Markov decision processes
    Feinberg, EA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244
  • [10] A note on deterministic approximation of discounted Markov decision processes
    Cruz-Suarez, Hugo
    Gordienko, Evgueni
    Montes-de-Oca, Raul
    APPLIED MATHEMATICS LETTERS, 2009, 22 (08) : 1252 - 1256