AN UNBOUNDED BERGE'S MINIMUM THEOREM WITH APPLICATIONS TO DISCOUNTED MARKOV DECISION PROCESSES

被引:0
|
作者
Montes-de-Oca, Raul [1 ]
Lemus-Rodriguez, Enrique [2 ]
机构
[1] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Mexico City 09340, DF, Mexico
[2] Univ Anahuac Mexico Norte, Escuela Actuaria, Huixquilucan 52786, Edo De Mexico, Mexico
关键词
Berge's minimum theorem; moment function; discounted Markov decision process; uniqueness of the optimal policy; continuous optimal policy; MAXIMUM THEOREMS; OPTIMAL POLICIES; CONTINUITY; GROWTH;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with a certain class of unbounded optimization problems. The optimization problems taken into account depend on a parameter. Firstly, there are established conditions which permit to guarantee the continuity with respect to the parameter of the minimum of the optimization problems under consideration, and the upper semicontinuity of the multifunction which applies each parameter into its set of minimizers. Besides, with the additional condition of uniqueness of the minimizer, its continuity is given. Some examples of nonconvex optimization problems that satisfy the conditions of the article are supplied. Secondly, the theory developed is applied to discounted Markov decision processes with unbounded cost functions and with possibly noncompact actions sets in order to obtain continuous optimal policies. This part of the paper is illustrated with two examples of the controlled Lindley's random walk. One of these examples has nonconstant action sets.
引用
收藏
页码:268 / 286
页数:19
相关论文
共 8 条
  • [1] An envelope theorem and some applications to discounted Markov decision processes
    Hugo Cruz-Suárez
    Raúl Montes-de-Oca
    Mathematical Methods of Operations Research, 2008, 67 : 299 - 321
  • [2] An envelope theorem and some applications to discounted Markov decision processes
    Cruz-Suarez, Hugo
    Montes-de-Oca, Raul
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 299 - 321
  • [3] A Version of the Euler Equation in Discounted Markov Decision Processes
    Cruz-Suarez, H.
    Zacarias-Espinoza, G.
    Vazquez-Guevara, V.
    JOURNAL OF APPLIED MATHEMATICS, 2012,
  • [4] Uniform convergence of value iteration policies for discounted Markov decision processes
    Cruz-Suarez, Daniel
    Montes-De-Oca, Raul
    BOLETIN DE LA SOCIEDAD MATEMATICA MEXICANA, 2006, 12 (01): : 133 - 148
  • [5] Constrained Markov decision processes in Borel spaces: from discounted to average optimality
    Mendoza-Perez, Armando F.
    Jasso-Fuentes, Hector
    De-la-Cruz Courtois, Omar A.
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 84 (03) : 489 - 525
  • [6] Optimality of Quasi-Open-Loop Policies for Discounted Semi-Markov Decision Processes
    Adelman, Daniel
    Mancini, Angelo J.
    MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) : 1222 - 1247
  • [7] Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion
    Wei, Qingda
    Chen, Xian
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 197 (01) : 309 - 333
  • [8] Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion
    Qingda Wei
    Xian Chen
    Journal of Optimization Theory and Applications, 2023, 197 : 309 - 333