REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引:2
|
作者
Piunovskiy, Alexey [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
关键词
continuous-time Markov decision process; total cost; discounted cost; relaxed strategy; randomized strategy; MODELS;
D O I
10.1137/17M1138959
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the Borel model of the continuous-time Markov decision process, we introduce a wide class of control strategies. In a particular case, such strategies transform to the standard relaxed strategies, intensively studied in the last decade. In another special case, if one restricts to another special subclass of the general strategies, the model transforms to the semi-Markov decision process. Further, we show that the relaxed strategies are not realizable. For the constrained optimal control problem with total expected costs, we describe the sufficient class of realizable strategies, the so-called Poisson-related strategies. Finally, we show that, for solving the formulated optimal control problems, one can use all the tools developed earlier for the classical discrete-time Markov decision processes.
引用
收藏
页码:473 / 495
页数:23
相关论文
共 50 条
  • [2] Impulsive control for continuous-time Markov decision processes
    Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
    33405, France
    不详
    L69 7ZL, United Kingdom
    Adv Appl Probab, 1 (106-127): : 106 - 127
  • [3] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
    Dufour, Francois
    Piunovskiy, Alexei B.
    ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
  • [4] The Transformation Method for Continuous-Time Markov Decision Processes
    Piunovskiy, Alexey
    Zhang, Yi
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712
  • [5] The Transformation Method for Continuous-Time Markov Decision Processes
    Alexey Piunovskiy
    Yi Zhang
    Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
  • [6] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
    Feinberg, Eugene A.
    Mandava, Manasa
    Shiryaev, Albert N.
    MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286
  • [7] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
    Guo, Xianping
    Song, Xinyuan
    ANNALS OF APPLIED PROBABILITY, 2011, 21 (05) : 2016 - 2049
  • [8] Constrained continuous-time Markov decision processes with average criteria
    Zhang, Lanlan
    Guo, Xianping
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340
  • [9] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
    Guo, Xianping
    Huang, Yonghui
    Zhang, Yi
    APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02) : 317 - 341
  • [10] Bias optimality for multichain continuous-time Markov decision processes
    Guo, Xianping
    Song, XinYuan
    Zhang, Junyu
    OPERATIONS RESEARCH LETTERS, 2009, 37 (05) : 317 - 321