IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引:0
|
作者
Dufour, Francois [1 ,2 ]
Piunovskiy, Alexei B. [3 ]
机构
[1] Univ Bordeaux, IMB, Bordeaux, France
[2] INRIA Bordeaux Sud Quest, F-33405 Talence, France
[3] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Impulsive control; continuous control; continuous-time Markov decision process; discounted cost; DRIFT PROCESSES;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper our objective is to study continuous-time Markov decision processes on a general Borel state space with both impulsive and continuous controls for the infinite time horizon discounted cost. The continuous-time controlled process is shown to be nonexplosive under appropriate hypotheses. The so-called Bellman equation associated to this control problem is studied. Sufficient conditions ensuring the existence and the uniqueness of a bounded measurable solution to this optimality equation are provided. Moreover, it is shown that the value function of the optimization problem under consideration satisfies this optimality equation. Sufficient conditions are also presented to ensure on the one hand the existence of an optimal control strategy, and on the other hand the existence of a epsilon-optimal control strategy. The decomposition of the state space into two disjoint subsets is exhibited where, roughly speaking, one should apply a gradual action or an impulsive action correspondingly to obtain an optimal or epsilon-optimal strategy. An interesting consequence of our previous results is as follows: the set of strategies that allow interventions, at time t = 0 and only immediately after natural jumps is a sufficient set for the control problem tinder consideraion.
引用
收藏
页码:106 / 127
页数:22
相关论文
共 50 条
  • [11] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
    Feinberg, Eugene A.
    Mandava, Manasa
    Shiryaev, Albert N.
    MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286
  • [12] MARKOV DECISION-PROCESSES WITH BOTH CONTINUOUS AND IMPULSIVE CONTROL
    YUSHKEVICH, AA
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1986, 81 : 234 - 246
  • [13] Constrained continuous-time Markov decision processes with average criteria
    Lanlan Zhang
    Xianping Guo
    Mathematical Methods of Operations Research, 2008, 67 : 323 - 340
  • [14] Constrained continuous-time Markov decision processes with average criteria
    Zhang, Lanlan
    Guo, Xianping
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2008, 67 (02) : 323 - 340
  • [15] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
    Xianping Guo
    Yonghui Huang
    Yi Zhang
    Applied Mathematics & Optimization, 2017, 75 : 317 - 341
  • [16] Bisimulation and logical preservation for continuous-time Markov decision processes
    Neuhaeusser, Martin R.
    Katoen, Joost-Pieter
    CONCUR 2007 - CONCURRENCY THEORY, PROCEEDINGS, 2007, 4703 : 412 - +
  • [17] Bisimulations and Logical Characterizations on Continuous-Time Markov Decision Processes
    Song, Lei
    Zhang, Lijun
    Godskesen, Jens Chr.
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION: (VMCAI 2014), 2014, 8318 : 98 - 117
  • [18] Bias optimality for multichain continuous-time Markov decision processes
    Guo, Xianping
    Song, XinYuan
    Zhang, Junyu
    OPERATIONS RESEARCH LETTERS, 2009, 37 (05) : 317 - 321
  • [19] A survey of recent results on continuous-time Markov decision processes
    Guo, Xianping
    Hernandez-Lerma, Onesimo
    Prieto-Rumeau, Tomas
    TOP, 2006, 14 (02) : 177 - 243