Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon

被引:6
|
作者
Cruz-Suarez, Hugo [1 ]
Ilhuicatzi-Roldan, Rocio [1 ]
Montes-de-Oca, Raul [2 ]
机构
[1] Benemerita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Puebla, Mexico
[2] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Mexico City 09340, DF, Mexico
关键词
Markov decision process; Total cost; Random horizon; Varying-time discount factor;
D O I
10.1007/s10957-012-0262-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper deals with Markov Decision Processes (MDPs) on Borel spaces with possibly unbounded costs. The criterion to be optimized is the expected total cost with a random horizon of infinite support. In this paper, it is observed that this performance criterion is equivalent to the expected total discounted cost with an infinite horizon and a varying-time discount factor. Then, the optimal value function and the optimal policy are characterized through some suitable versions of the Dynamic Programming Equation. Moreover, it is proved that the optimal value function of the optimal control problem with a random horizon can be bounded from above by the optimal value function of a discounted optimal control problem with a fixed discount factor. In this case, the discount factor is defined in an adequate way by the parameters introduced for the study of the optimal control problem with a random horizon. To illustrate the theory developed, a version of the Linear-Quadratic model with a random horizon and a Logarithm Consumption-Investment model are presented.
引用
收藏
页码:329 / 346
页数:18
相关论文
共 50 条
  • [21] MAXIMIZING THE PROBABILITY OF VISITING A SET INFINITELY OFTEN FOR A MARKOV DECISION PROCESS WITH BOREL STATE AND ACTION SPACES
    Dufour, Francois
    Prieto-Rumeau, Tomas
    JOURNAL OF APPLIED PROBABILITY, 2024, 61 (04) : 1424 - 1447
  • [22] Threshold probability of non-terminal type in finite horizon Markov decision processes
    Kira, Akifumi
    Ueno, Takayuki
    Fujita, Toshiharu
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2012, 386 (01) : 461 - 472
  • [23] Markov Decision Processes
    Bäuerle N.
    Rieder U.
    Jahresbericht der Deutschen Mathematiker-Vereinigung, 2010, 112 (4) : 217 - 243
  • [24] Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
    Feinberg, Eugene A.
    Kasyanov, Pavlo O.
    Zadoianchuk, Nina V.
    MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (04) : 591 - 607
  • [25] A note on the structure of value spaces in vector-valued Markov decision processes
    Wakuta, K
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1999, 49 (01) : 77 - 85
  • [26] Distributionally Robust Chance-Constrained Markov Decision Processes with Random Payoff
    Nguyen, Hoang Nam
    Lisser, Abdel
    Singh, Vikas Vikram
    APPLIED MATHEMATICS AND OPTIMIZATION, 2024, 90 (01):
  • [27] Online Markov Decision Processes
    Even-Dar, Eyal
    Kakade, Sham M.
    Mansour, Yishay
    MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 726 - 736
  • [28] Quantile Markov Decision Processes
    Li, Xiaocheng
    Zhong, Huaiyang
    Brandeau, Margaret L.
    OPERATIONS RESEARCH, 2021, 70 (03) : 1428 - 1447
  • [29] AN EXTENDED VERSION OF AVERAGE MARKOV DECISION PROCESSES ON DISCRETE SPACES UNDER FUZZY ENVIRONMENT
    Cruz-Suarez, Hugo
    Montes-De-Oca, Raul
    Ortega-Gutierrez, R. Israel
    KYBERNETIKA, 2023, 59 (01) : 160 - 178
  • [30] On existence of Berk-Nash equilibria in misspecified Markov decision processes with infinite spaces
    Anderson, Robert M.
    Duanmu, Haosui
    Ghosh, Aniruddha
    Khan, M. Ali
    JOURNAL OF ECONOMIC THEORY, 2024, 217