Markov decision process;
Total cost;
Random horizon;
Varying-time discount factor;
D O I:
10.1007/s10957-012-0262-8
中图分类号:
C93 [管理学];
O22 [运筹学];
学科分类号:
070105 ;
12 ;
1201 ;
1202 ;
120202 ;
摘要:
This paper deals with Markov Decision Processes (MDPs) on Borel spaces with possibly unbounded costs. The criterion to be optimized is the expected total cost with a random horizon of infinite support. In this paper, it is observed that this performance criterion is equivalent to the expected total discounted cost with an infinite horizon and a varying-time discount factor. Then, the optimal value function and the optimal policy are characterized through some suitable versions of the Dynamic Programming Equation. Moreover, it is proved that the optimal value function of the optimal control problem with a random horizon can be bounded from above by the optimal value function of a discounted optimal control problem with a fixed discount factor. In this case, the discount factor is defined in an adequate way by the parameters introduced for the study of the optimal control problem with a random horizon. To illustrate the theory developed, a version of the Linear-Quadratic model with a random horizon and a Logarithm Consumption-Investment model are presented.
机构:
INRIA Team Astral, 200 Ave Vieille Tour, F-33405 Talence, FranceINRIA Team Astral, 200 Ave Vieille Tour, F-33405 Talence, France
Dufour, Francois
Prieto-Rumeau, Tomas
论文数: 0引用数: 0
h-index: 0
机构:
UNED, Fac Sci, Dept Stat Operat Res & Numer Calculus, calle Juan del Rosal 10, Madrid 28040, SpainINRIA Team Astral, 200 Ave Vieille Tour, F-33405 Talence, France
机构:
Univ Paris Saclay, Lab Signals & Syst, CNRS, CentraleSupelec, 3 Rue Joliot Curie, F-91190 Gif Sur Yvette, FranceUniv Paris Saclay, Lab Signals & Syst, CNRS, CentraleSupelec, 3 Rue Joliot Curie, F-91190 Gif Sur Yvette, France
Lisser, Abdel
Singh, Vikas Vikram
论文数: 0引用数: 0
h-index: 0
机构:
Indian Inst Technol Delhi, Dept Math, New Delhi 110016, IndiaUniv Paris Saclay, Lab Signals & Syst, CNRS, CentraleSupelec, 3 Rue Joliot Curie, F-91190 Gif Sur Yvette, France
Singh, Vikas Vikram
APPLIED MATHEMATICS AND OPTIMIZATION,
2024,
90
(01):
机构:
Benemierita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Ave San Claudio & Rio Verde, Puebla 72570, Puebla, MexicoBenemierita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Ave San Claudio & Rio Verde, Puebla 72570, Puebla, Mexico
Cruz-Suarez, Hugo
Montes-De-Oca, Raul
论文数: 0引用数: 0
h-index: 0
机构:
Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Ave Ferrocarril San Rafael Atlixco 186,Col Leyes, Mexico City 09310, MexicoBenemierita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Ave San Claudio & Rio Verde, Puebla 72570, Puebla, Mexico
Montes-De-Oca, Raul
Ortega-Gutierrez, R. Israel
论文数: 0引用数: 0
h-index: 0
机构:
Benemierita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Ave San Claudio & Rio Verde, Puebla 72570, Puebla, MexicoBenemierita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Ave San Claudio & Rio Verde, Puebla 72570, Puebla, Mexico