DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED RATES: THE CONVEX ANALYTIC APPROACH

被引：48

作者：

Piunovskiy, Alexey ^{[1
]}

Zhang, Yi ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2011年 / 49卷 / 05期

基金：

英国工程与自然科学研究理事会;

关键词：

Borel space; constrained continuous-time Markov decision process; convex analytic approach; duality; history-dependent policies; unbounded rates; MODEL;

D O I：

10.1137/10081366X

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with constrained discounted continuous-time Markov decision processes, also known as controlled jump Markov processes, with Borel state and action spaces. Under some conditions imposed on the primitives, allowing unbounded transition rates and unbounded (from both above and below) cost rates, first, we study the space of occupation measures. Then we reformulate the original problem as a linear program over the space of those measures and undertake the duality analysis. Finally, under some compactness-continuity conditions, we show the existence of a stationary optimal policy out of the class of randomized history-dependent policies.

引用

页码：2032 / 2061

页数：30

共 50 条

[1] Discounted Continuous-Time Markov Decision Processes with Constraints: Unbounded Transition and Loss Rates
Guo, Xianping
Piunovskiy, Alexei
MATHEMATICS OF OPERATIONS RESEARCH, 2011, 36 (01) : 105 - 132
[2] Discounted continuous-time Markov decision processes with unbounded rates and randomized history-dependent policies: the dynamic programming approach
Piunovskiy, Alexey
Zhang, Yi
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2014, 12 (01): : 49 - 75
[3] Convex analytic approach to constrained discounted Markov decision processes with non-constant discount factors
Yi Zhang
TOP, 2013, 21 : 378 - 408
[4] Convex analytic approach to constrained discounted Markov decision processes with non-constant discount factors
Zhang, Yi
TOP, 2013, 21 (02) : 378 - 408
[5] The Transformation Method for Continuous-Time Markov Decision Processes
Piunovskiy, Alexey
Zhang, Yi
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712
[6] The Transformation Method for Continuous-Time Markov Decision Processes
Alexey Piunovskiy
Yi Zhang
Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
[7] A CONVEX ANALYTIC APPROACH TO RISK-AWARE MARKOV DECISION PROCESSES
Haskell, William B.
Jain, Rahul
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (03) : 1569 - 1598
[8] Constrained total undiscounted continuous-time Markov decision processes
Guo, Xianping
Zhang, Yi
BERNOULLI, 2017, 23 (03) : 1694 - 1736
[9] Finite horizon continuous-time Markov decision processes with mean and variance criteria
Huang, Yonghui
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (04): : 539 - 564
[10] MULTIOBJECTIVE STOPPING PROBLEM FOR DISCRETE-TIME MARKOV PROCESSES: CONVEX ANALYTIC APPROACH
Dufour, F.
Piunovskiy, A. B.
JOURNAL OF APPLIED PROBABILITY, 2010, 47 (04) : 947 - 966

← 1 2 3 4 5 →