Constrained continuous-time Markov decision processes with average criteria

被引:0
|
作者
Lanlan Zhang
Xianping Guo
机构
[1] Zhongshan University,The School of Mathematics and Computational Science
关键词
Continuous-time Markov decision process; Unbounded reward/cost and transition rates; Average criteria; Constrained-optimal policy; 90C40; 93E20;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we study constrained continuous-time Markov decision processes with a denumerable state space and unbounded reward/cost and transition rates. The criterion to be maximized is the expected average reward, and a constraint is imposed on an expected average cost. We give suitable conditions that ensure the existence of a constrained-optimal policy. Moreover, we show that the constrained-optimal policy randomizes between two stationary policies differing in at most one state. Finally, we use a controlled queueing system to illustrate our conditions.
引用
收藏
页码:323 / 340
页数:17
相关论文
共 50 条