Solving generalized semi-Markov decision processes using continuous phase-type distributions

被引：0

作者：

Younes, HLS ^{[1
]}

Simmons, RG ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

来源：

PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the generalized semi-Markov decision process (GSMDP) as an extension of continuous-time MDPs and semi-Markov decision processes (SMDPs) for modeling stochastic decision processes with asynchronous events and actions. Using phase-type distributions and uniformization, we show how an arbitrary GSMDP can be approximated by a discrete-time MDP, which can then be solved using existing MDP techniques. The techniques we present can also be seen as an alternative approach for solving SMDPs, and we demonstrate that the introduction of phases allows us to generate higher quality policies than those obtained by standard SMDP solution techniques.

引用

页码：742 / 747

页数：6

共 50 条

[11] CONTINUITY OF GENERALIZED SEMI-MARKOV PROCESSES
WHITT, W
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 1980, 5 (04) : 494 - 501
[12] MONOTONICITY IN GENERALIZED SEMI-MARKOV PROCESSES
GLASSERMAN, P
YAO, DD
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (01) : 1 - 21
[13] Continuous semi-Markov processes and their applications
Harlamov, BP
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2004, 33 (03) : 569 - 589
[14] Using Semi-Markov Chains to Solve Semi-Markov Processes
Wu, Bei
Maya, Brenda Ivette Garcia
Limnios, Nikolaos
[J]. METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2021, 23 (04) : 1419 - 1431
[15] Using Semi-Markov Chains to Solve Semi-Markov Processes
Bei Wu
Brenda Ivette Garcia Maya
Nikolaos Limnios
[J]. Methodology and Computing in Applied Probability, 2021, 23 : 1419 - 1431
[16] Estimating parametric semi-Markov models from panel data using phase-type approximations
Titman, Andrew C.
[J]. STATISTICS AND COMPUTING, 2014, 24 (02) : 155 - 164
[17] Estimating parametric semi-Markov models from panel data using phase-type approximations
Andrew C. Titman
[J]. Statistics and Computing, 2014, 24 : 155 - 164
[18] Reliable computation of workload distributions using semi-Markov processes
Kempken, Sebastian
Luther, Wolfram
Traczinski, Daniela
Hasslinger, Gerhard
[J]. ASMTA 2006: 13th International Conference on Analytical and Stochastic Modelling Techniques and Applications, Proceedings, 2006, : 111 - 116
[19] Nonstationary continuous time Markov decision processes in a semi-Markov environment with discounted criterion
[J]. J Math Anal Appl, 3 (640):
[20] INSENSITIVITY OF STEADY-STATE DISTRIBUTIONS OF GENERALIZED SEMI-MARKOV PROCESSES WITH SPEEDS
SCHASSBERGER, R
[J]. ADVANCES IN APPLIED PROBABILITY, 1978, 10 (04) : 836 - 851

← 1 2 3 4 5 →