DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES: THE CONVEX ANALYTIC APPROACH

被引:48
|
作者
Piunovskiy, Alexey [1 ]
Zhang, Yi [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Borel space; constrained continuous-time Markov decision process; convex analytic approach; duality; history-dependent policies; unbounded rates; MODEL;
D O I
10.1137/10081366X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with constrained discounted continuous-time Markov decision processes, also known as controlled jump Markov processes, with Borel state and action spaces. Under some conditions imposed on the primitives, allowing unbounded transition rates and unbounded (from both above and below) cost rates, first, we study the space of occupation measures. Then we reformulate the original problem as a linear program over the space of those measures and undertake the duality analysis. Finally, under some compactness-continuity conditions, we show the existence of a stationary optimal policy out of the class of randomized history-dependent policies.
引用
收藏
页码:2032 / 2061
页数:30
相关论文
共 50 条