Impulsive Control for Continuous-Time Markov Decision Processes: A Linear Programming Approach

被引:11
作者
Dufour, F. [1 ,2 ,3 ]
Piunovskiy, A. B. [4 ]
机构
[1] Bordeaux INP, IMB, UMR CNRS 5251, Talence, France
[2] Univ Bordeaux, UMR CNRS 5251, IMB, Talence, France
[3] Inria Bordeaux Sud Ouest, 200 Ave Vieille Tour, F-33405 Talence, France
[4] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Impulsive control; Continuous control; Continuous-time Markov decision process; Linear programming approach; Discounted cost; DRIFT PROCESSES;
D O I
10.1007/s00245-015-9310-8
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we investigate an optimization problem for continuous-time Markov decision processes with both impulsive and continuous controls. We consider the so-called constrained problem where the objective of the controller is to minimize a total expected discounted optimality criterion associated with a cost rate function while keeping other performance criteria of the same form, but associated with different cost rate functions, below some given bounds. Our model allows multiple impulses at the same time moment. The main objective of this work is to study the associated linear program defined on a space of measures including the occupation measures of the controlled process and to provide sufficient conditions to ensure the existence of an optimal control.
引用
收藏
页码:129 / 161
页数:33
相关论文
共 32 条
[11]  
Guo X, 2009, STOCH MOD APPL PROBA, V62, P1, DOI 10.1007/978-3-642-02547-1
[12]   A survey of recent results on continuous-time Markov decision processes [J].
Guo, Xianping ;
Hernandez-Lerma, Onesimo ;
Prieto-Rumeau, Tomas .
TOP, 2006, 14 (02) :177-243
[13]   Discounted Continuous-Time Markov Decision Processes with Constraints: Unbounded Transition and Loss Rates [J].
Guo, Xianping ;
Piunovskiy, Alexei .
MATHEMATICS OF OPERATIONS RESEARCH, 2011, 36 (01) :105-132
[14]  
Helmes K., 2007, Stochastics, V79, P309, DOI DOI 10.1080/17442500600976814
[15]  
Hernandez-Lerma O., 1996, APPL MATH, V30
[16]  
HERNANDEZLERMA O, 1999, APPL MATH, V42
[17]   MARKOV DECISION DRIFT PROCESSES - CONDITIONS FOR OPTIMALITY OBTAINED BY DISCRETIZATION [J].
HORDIJK, A ;
SCHOUTEN, FV .
MATHEMATICS OF OPERATIONS RESEARCH, 1985, 10 (01) :160-173
[18]   AVERAGE OPTIMAL POLICIES IN MARKOV DECISION DRIFT PROCESSES WITH APPLICATIONS TO A QUEUING AND A REPLACEMENT MODEL [J].
HORDIJK, A ;
SCHOUTEN, FAV .
ADVANCES IN APPLIED PROBABILITY, 1983, 15 (02) :274-303
[19]  
Jacod J., 1974, GEBIETE, V31, P235
[20]  
Jacod J., 1979, Lecture Notes in Mathematics, V714