AN UNBOUNDED BERGE'S MINIMUM THEOREM WITH APPLICATIONS TO DISCOUNTED MARKOV DECISION PROCESSES

被引：0

作者：

Montes-de-Oca, Raul ^{[1
]}

Lemus-Rodriguez, Enrique ^{[2
]}

机构：

[1] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Mexico City 09340, DF, Mexico

[2] Univ Anahuac Mexico Norte, Escuela Actuaria, Huixquilucan 52786, Edo De Mexico, Mexico

来源：

KYBERNETIKA | 2012年 / 48卷 / 02期

关键词：

Berge's minimum theorem; moment function; discounted Markov decision process; uniqueness of the optimal policy; continuous optimal policy; MAXIMUM THEOREMS; OPTIMAL POLICIES; CONTINUITY; GROWTH;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with a certain class of unbounded optimization problems. The optimization problems taken into account depend on a parameter. Firstly, there are established conditions which permit to guarantee the continuity with respect to the parameter of the minimum of the optimization problems under consideration, and the upper semicontinuity of the multifunction which applies each parameter into its set of minimizers. Besides, with the additional condition of uniqueness of the minimizer, its continuity is given. Some examples of nonconvex optimization problems that satisfy the conditions of the article are supplied. Secondly, the theory developed is applied to discounted Markov decision processes with unbounded cost functions and with possibly noncompact actions sets in order to obtain continuous optimal policies. This part of the paper is illustrated with two examples of the controlled Lindley's random walk. One of these examples has nonconstant action sets.

引用

页码：268 / 286

页数：19

共 8 条

[1] An envelope theorem and some applications to discounted Markov decision processes
Hugo Cruz-Suárez
Raúl Montes-de-Oca
Mathematical Methods of Operations Research, 2008, 67 : 299 - 321
[2] An envelope theorem and some applications to discounted Markov decision processes
Cruz-Suarez, Hugo
Montes-de-Oca, Raul
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 299 - 321
[3] A Version of the Euler Equation in Discounted Markov Decision Processes
Cruz-Suarez, H.
Zacarias-Espinoza, G.
Vazquez-Guevara, V.
JOURNAL OF APPLIED MATHEMATICS, 2012,
[4] Uniform convergence of value iteration policies for discounted Markov decision processes
Cruz-Suarez, Daniel
Montes-De-Oca, Raul
BOLETIN DE LA SOCIEDAD MATEMATICA MEXICANA, 2006, 12 (01): : 133 - 148
[5] Constrained Markov decision processes in Borel spaces: from discounted to average optimality
Mendoza-Perez, Armando F.
Jasso-Fuentes, Hector
De-la-Cruz Courtois, Omar A.
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 84 (03) : 489 - 525
[6] Optimality of Quasi-Open-Loop Policies for Discounted Semi-Markov Decision Processes
Adelman, Daniel
Mancini, Angelo J.
MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) : 1222 - 1247
[7] Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion
Wei, Qingda
Chen, Xian
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 197 (01) : 309 - 333
[8] Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion
Qingda Wei
Xian Chen
Journal of Optimization Theory and Applications, 2023, 197 : 309 - 333

← 1 →