Risk-sensitive discounted Markov decision processes with unbounded reward functions and Borel spaces

被引:1
作者
Guo, Xin [1 ]
机构
[1] Sun Yat Sen Univ, Sch Sci, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Markov decision process; unbounded reward functions; risk-sensitive discounted optimality; optimality equation; optimal policy; DISCRETE-TIME;
D O I
10.1080/17442508.2024.2314462
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper attempts to study the risk-sensitive discounted discrete-time Markov decision processes in Borel spaces, in which the reward functions are allowed to be unbounded from above and from below. We find mild conditions imposed on the primitive data of the decision processes, which not only ensure the existence of a solution to the optimality equation (OE in short), but also are the generalization of the bounded reward case. Furthermore, using the OE and a novel technique, we prove the existence of an optimal policy out of the class of randomized history-dependent policies. Finally, we illustrate our results with an inventory system.
引用
收藏
页码:649 / 666
页数:18
相关论文
共 17 条
[1]   Partially Observable Risk-Sensitive Markov Decision Processes [J].
Baeuerle, Nicole ;
Rieder, Ulrich .
MATHEMATICS OF OPERATIONS RESEARCH, 2017, 42 (04) :1180-1196
[2]   Risk-sensitive dividend problems [J].
Baeuerle, Nicole ;
Jaskiewicz, Anna .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 242 (01) :161-171
[3]   More Risk-Sensitive Markov Decision Processes [J].
Baeuerle, Nicole ;
Rieder, Ulrich .
MATHEMATICS OF OPERATIONS RESEARCH, 2014, 39 (01) :105-120
[4]   Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management [J].
Bielecki, T ;
Hernández-Hernández, D ;
Pliska, SR .
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1999, 50 (02) :167-188
[5]   Risk-sensitive optimal control for Markov decision processes with monotone cost [J].
Borkar, VS ;
Meyn, SP .
MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (01) :192-209
[6]   Risk-sensitive control of discrete-time Markov processes with infinite horizon [J].
Di Masi, GB ;
Stettner, L .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) :61-78
[7]   Infinite horizon risk sensitive control of discrete time Markov processes under minorization property [J].
Di Masi, Giovanni B. ;
Stettner, Lukasz .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2007, 46 (01) :231-252
[8]   Risk-sensitive control of continuous time Markov chains [J].
Ghosh, Mrinal K. ;
Saha, Subhamay .
STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 2014, 86 (04) :655-675
[9]   RISK-SENSITIVE DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES [J].
Guo, Xianping ;
Liao, Zhong-Wei .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (06) :3857-3883
[10]   Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates [J].
Guo, Xin ;
Liu, Qiuli ;
Zhang, Yi .
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2019, 17 (04) :427-442