Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

被引：4

作者：

Guo, Xianping ^{[1
]}

Huang, Yonghui ^{[1
]}

Zhang, Yi ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China

[2] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2017年 / 75卷 / 02期

关键词：

Continuous-time Markov decision process; Constrained-optimality; Finite horizon; Mixture of N+1 deterministic Markov policies; Occupation measure;

D O I：

10.1007/s00245-016-9352-6

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper studies the constrained (nonhomogeneous) continuous-time Markov decision processes on the finite horizon. The performance criterion to be optimized is the expected total reward on the finite horizon, while N constraints are imposed on similar expected costs. Introducing the appropriate notion of the occupation measures for the concerned optimal control problem, we establish the following under some suitable conditions: (a) the class of Markov policies is sufficient; (b) every extreme point of the space of performance vectors is generated by a deterministic Markov policy; and (c) there exists an optimal Markov policy, which is a mixture of no more than N + 1 deterministic Markov policies.

引用

页码：317 / 341

页数：25

共 29 条

[1] [Anonymous], 1999, STOCH MODEL SER, DOI 10.1201/9781315140223
[2] [Anonymous], 2012, CONTINUOUS TIME CONT
[3] [Anonymous], 1995, Controlled Queueing Systems
[4] Infinite horizon optimal impulsive control with applications to Internet congestion control
Avrachenkov, Konstantin
Habachi, Oussama
Piunovskiy, Alexey
Zhang, Yi
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2015, 88 (04) : 703 - 716
[5] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
[6] Continuous time discounted jump Markov decision processes: A discrete-event approach
Feinberg, EA
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (03) : 492 - 524
[7] On solutions of Kolmogorov's equations for nonhomogeneous jump Markov processes
Feinberg, Eugene A.
Mandava, Manasa
Shiryaev, Albert N.
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2014, 411 (01) : 261 - 270
[8] Splitting Randomized Stationary Policies in Total-Reward Markov Decision Processes
Feinberg, Eugene A.
Rothblum, Uriel G.
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (01) : 129 - 153
[9] FINITE-HORIZON OPTIMALITY FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED TRANSITION RATES
Guo, Xianping
Huang, Xiangxiang
Huang, Yonghui
[J]. ADVANCES IN APPLIED PROBABILITY, 2015, 47 (04) : 1064 - 1087
[10] Guo XP, 2013, ADV APPL PROBAB, V45, P490

← 1 2 3 →