REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引：2

作者：

Piunovskiy, Alexey ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2018年 / 56卷 / 01期

关键词：

continuous-time Markov decision process; total cost; discounted cost; relaxed strategy; randomized strategy; MODELS;

D O I：

10.1137/17M1138959

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For the Borel model of the continuous-time Markov decision process, we introduce a wide class of control strategies. In a particular case, such strategies transform to the standard relaxed strategies, intensively studied in the last decade. In another special case, if one restricts to another special subclass of the general strategies, the model transforms to the semi-Markov decision process. Further, we show that the relaxed strategies are not realizable. For the constrained optimal control problem with total expected costs, we describe the sufficient class of realizable strategies, the so-called Poisson-related strategies. Finally, we show that, for solving the formulated optimal control problems, one can use all the tools developed earlier for the classical discrete-time Markov decision processes.

引用

页码：473 / 495

页数：23

共 50 条

[1] RANDOMIZED AND RELAXED STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES
Piunovskiy, Alexey
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (06) : 3503 - 3533
[2] Impulsive control for continuous-time Markov decision processes
Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
33405, France
不详
L69 7ZL, United Kingdom
Adv Appl Probab, 1 (106-127): : 106 - 127
[3] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
Dufour, Francois
Piunovskiy, Alexei B.
ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
[4] The Transformation Method for Continuous-Time Markov Decision Processes
Piunovskiy, Alexey
Zhang, Yi
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712
[5] The Transformation Method for Continuous-Time Markov Decision Processes
Alexey Piunovskiy
Yi Zhang
Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
[6] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
Feinberg, Eugene A.
Mandava, Manasa
Shiryaev, Albert N.
MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286
[7] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
Guo, Xianping
Song, Xinyuan
ANNALS OF APPLIED PROBABILITY, 2011, 21 (05) : 2016 - 2049
[8] Constrained continuous-time Markov decision processes with average criteria
Zhang, Lanlan
Guo, Xianping
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340
[9] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
Guo, Xianping
Huang, Yonghui
Zhang, Yi
APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02) : 317 - 341
[10] Bias optimality for multichain continuous-time Markov decision processes
Guo, Xianping
Song, XinYuan
Zhang, Junyu
OPERATIONS RESEARCH LETTERS, 2009, 37 (05) : 317 - 321

← 1 2 3 4 5 →