The Transformation Method for Continuous-Time Markov Decision Processes

被引:13
|
作者
Piunovskiy, Alexey [1 ]
Zhang, Yi [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
关键词
Discrete-time Markov decision process; Continuous-time Markov decision process; Unbounded transition rates; Transformation method; History-dependent policies; RATES;
D O I
10.1007/s10957-012-0015-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we show that a discounted continuous-time Markov decision process in Borel spaces with randomized history-dependent policies, arbitrarily unbounded transition rates and a non-negative reward rate is equivalent to a discrete-time Markov decision process. Based on a completely new proof, which does not involve Kolmogorov's forward equation, it is shown that the value function for both models is given by the minimal non-negative solution to the same Bellman equation. A verifiable necessary and sufficient condition for the finiteness of this value function is given, which induces a new condition for the non-explosion of the underlying controlled process.
引用
收藏
页码:691 / 712
页数:22
相关论文
共 50 条
  • [1] The Transformation Method for Continuous-Time Markov Decision Processes
    Alexey Piunovskiy
    Yi Zhang
    Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
  • [2] Impulsive control for continuous-time Markov decision processes
    Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
    33405, France
    不详
    L69 7ZL, United Kingdom
    Adv Appl Probab, 1 (106-127):
  • [3] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
    Dufour, Francois
    Piunovskiy, Alexei B.
    ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
  • [5] Continuous-Time Markov Decision Processes with Controlled Observations
    Huang, Yunhan
    Kavitha, Veeraruna
    Zhu, Quanyan
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 32 - 39
  • [6] Continuous-Time Markov Decision Processes with Exponential Utility
    Zhang, Yi
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (04) : 2636 - 2660
  • [7] Delayed Nondeterminism in Continuous-Time Markov Decision Processes
    Neuhaeusser, Martin R.
    Stoelinga, Marielle
    Katoen, Joost-Pieter
    FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, PROCEEDINGS, 2009, 5504 : 364 - +
  • [8] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
    Feinberg, Eugene A.
    Mandava, Manasa
    Shiryaev, Albert N.
    MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286
  • [9] Constrained continuous-time Markov decision processes with average criteria
    Lanlan Zhang
    Xianping Guo
    Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
  • [10] Constrained continuous-time Markov decision processes with average criteria
    Zhang, Lanlan
    Guo, Xianping
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340