The Transformation Method for Continuous-Time Markov Decision Processes

被引：13

作者：

Piunovskiy, Alexey ^{[1
]}

Zhang, Yi ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS | 2012年 / 154卷 / 02期

关键词：

Discrete-time Markov decision process; Continuous-time Markov decision process; Unbounded transition rates; Transformation method; History-dependent policies; RATES;

D O I：

10.1007/s10957-012-0015-8

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we show that a discounted continuous-time Markov decision process in Borel spaces with randomized history-dependent policies, arbitrarily unbounded transition rates and a non-negative reward rate is equivalent to a discrete-time Markov decision process. Based on a completely new proof, which does not involve Kolmogorov's forward equation, it is shown that the value function for both models is given by the minimal non-negative solution to the same Bellman equation. A verifiable necessary and sufficient condition for the finiteness of this value function is given, which induces a new condition for the non-explosion of the underlying controlled process.

引用

页码：691 / 712

页数：22

共 50 条

[1] The Transformation Method for Continuous-Time Markov Decision Processes
Alexey Piunovskiy
Yi Zhang
Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
[2] Impulsive control for continuous-time Markov decision processes
Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
33405, France
不详
L69 7ZL, United Kingdom
Adv Appl Probab, 1 (106-127):
[3] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
Dufour, Francois
Piunovskiy, Alexei B.
ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
[4] REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES
Piunovskiy, Alexey
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (01) : 473 - 495
[5] Continuous-Time Markov Decision Processes with Controlled Observations
Huang, Yunhan
Kavitha, Veeraruna
Zhu, Quanyan
2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 32 - 39
[6] Continuous-Time Markov Decision Processes with Exponential Utility
Zhang, Yi
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (04) : 2636 - 2660
[7] Delayed Nondeterminism in Continuous-Time Markov Decision Processes
Neuhaeusser, Martin R.
Stoelinga, Marielle
Katoen, Joost-Pieter
FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, PROCEEDINGS, 2009, 5504 : 364 - +
[8] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
Feinberg, Eugene A.
Mandava, Manasa
Shiryaev, Albert N.
MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286
[9] Constrained continuous-time Markov decision processes with average criteria
Lanlan Zhang
Xianping Guo
Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
[10] Constrained continuous-time Markov decision processes with average criteria
Zhang, Lanlan
Guo, Xianping
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340

← 1 2 3 4 5 →