The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

被引:3
|
作者
Costa, O. L. V. [1 ]
Dufour, F. [2 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Dept Engn Telecomunicacoes & Controle, BR-05508900 Sao Paulo, Brazil
[2] Univ Bordeaux 1, IMB, Team CQFD, INRIA Bordeaux Sud Ouest, F-33405 Talence, France
来源
APPLIED MATHEMATICS AND OPTIMIZATION | 2010年 / 62卷 / 02期
关键词
Piecewise-deterministic Markov Processes; Continuous-time; Long-run average cost; Optimal control; Integro-differential optimality inequation; Policy iteration algorithm; DECISION-PROCESSES; OPTIMALITY;
D O I
10.1007/s00245-010-9099-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.
引用
收藏
页码:185 / 204
页数:20
相关论文
共 50 条
  • [31] Numerical method for impulse control of piecewise deterministic Markov processes
    de Saporta, Benoite
    Dufour, Francois
    AUTOMATICA, 2012, 48 (05) : 779 - 793
  • [32] Communicating piecewise deterministic Markov processes
    Strubbe, Stefan
    van der Schaft, Arjan
    STOCHASTIC HYBRID SYSTEMS: THEORY AND SAFETY CRITICAL APPLICATIONS, 2006, 337 : 65 - 104
  • [33] Piecewise-deterministic Markov processes
    Kazak, Jolanta
    ANNALES POLONICI MATHEMATICI, 2013, 109 (03) : 279 - 296
  • [34] PIECEWISE DETERMINISTIC MARKOV-PROCESSES
    CAI, HY
    STOCHASTIC ANALYSIS AND APPLICATIONS, 1993, 11 (03) : 255 - 274
  • [35] Policy Iteration for Decentralized Control of Markov Decision Processes
    Bernstein, Daniel S.
    Amato, Christopher
    Hansen, Eric A.
    Zilberstein, Shlomo
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 89 - 132
  • [36] PIECEWISE-DETERMINISTIC MARKOV PROCESSES AS LIMITS OF MARKOV JUMP PROCESSES
    Franz, Uwe
    Liebscher, Volkmar
    Zeiser, Stefan
    ADVANCES IN APPLIED PROBABILITY, 2012, 44 (03) : 729 - 748
  • [37] CONSTRAINED AND UNCONSTRAINED OPTIMAL DISCOUNTED CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
    Costa, O. L. V.
    Dufour, F.
    Piunovskiy, A. B.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2016, 54 (03) : 1444 - 1474
  • [38] Piecewise deterministic Markov control processes with feedback controls and unbounded costs
    Forwick, L
    Schäl, M
    Schmitz, M
    ACTA APPLICANDAE MATHEMATICAE, 2004, 82 (03) : 239 - 267
  • [39] Piecewise Deterministic Markov Control Processes with Feedback Controls and Unbounded Costs
    Lothar Forwick
    Manfred Schäl
    Michael Schmitz
    Acta Applicandae Mathematica, 2004, 82 : 239 - 267
  • [40] Piecewise Deterministic Markov Processes in Biological Models
    Rudnicki, Ryszard
    Tyran-Kaminska, Marta
    SEMIGROUPS OF OPERATORS - THEORY AND APPLICATIONS, 2015, 113 : 235 - 255