Constrained Continuous-Time Markov Decision Processes on the Finite Horizon

被引:4
作者
Guo, Xianping [1 ]
Huang, Yonghui [1 ]
Zhang, Yi [2 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
关键词
Continuous-time Markov decision process; Constrained-optimality; Finite horizon; Mixture of N+1 deterministic Markov policies; Occupation measure;
D O I
10.1007/s00245-016-9352-6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper studies the constrained (nonhomogeneous) continuous-time Markov decision processes on the finite horizon. The performance criterion to be optimized is the expected total reward on the finite horizon, while N constraints are imposed on similar expected costs. Introducing the appropriate notion of the occupation measures for the concerned optimal control problem, we establish the following under some suitable conditions: (a) the class of Markov policies is sufficient; (b) every extreme point of the space of performance vectors is generated by a deterministic Markov policy; and (c) there exists an optimal Markov policy, which is a mixture of no more than N + 1 deterministic Markov policies.
引用
收藏
页码:317 / 341
页数:25
相关论文
共 29 条
  • [1] [Anonymous], 1999, STOCH MODEL SER, DOI 10.1201/9781315140223
  • [2] [Anonymous], 2012, CONTINUOUS TIME CONT
  • [3] [Anonymous], 1995, Controlled Queueing Systems
  • [4] Infinite horizon optimal impulsive control with applications to Internet congestion control
    Avrachenkov, Konstantin
    Habachi, Oussama
    Piunovskiy, Alexey
    Zhang, Yi
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2015, 88 (04) : 703 - 716
  • [5] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
  • [6] Continuous time discounted jump Markov decision processes: A discrete-event approach
    Feinberg, EA
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (03) : 492 - 524
  • [7] On solutions of Kolmogorov's equations for nonhomogeneous jump Markov processes
    Feinberg, Eugene A.
    Mandava, Manasa
    Shiryaev, Albert N.
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2014, 411 (01) : 261 - 270
  • [8] Splitting Randomized Stationary Policies in Total-Reward Markov Decision Processes
    Feinberg, Eugene A.
    Rothblum, Uriel G.
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (01) : 129 - 153
  • [9] FINITE-HORIZON OPTIMALITY FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH UNBOUNDED TRANSITION RATES
    Guo, Xianping
    Huang, Xiangxiang
    Huang, Yonghui
    [J]. ADVANCES IN APPLIED PROBABILITY, 2015, 47 (04) : 1064 - 1087
  • [10] Guo XP, 2013, ADV APPL PROBAB, V45, P490