The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

被引：3

作者：

Costa, O. L. V. ^{[1
]}

Dufour, F. ^{[2
]}

机构：

[1] Univ Sao Paulo, Escola Politecn, Dept Engn Telecomunicacoes & Controle, BR-05508900 Sao Paulo, Brazil

[2] Univ Bordeaux 1, IMB, Team CQFD, INRIA Bordeaux Sud Ouest, F-33405 Talence, France

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2010年 / 62卷 / 02期

关键词：

Piecewise-deterministic Markov Processes; Continuous-time; Long-run average cost; Optimal control; Integro-differential optimality inequation; Policy iteration algorithm; DECISION-PROCESSES; OPTIMALITY;

D O I：

10.1007/s00245-010-9099-4

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.

引用

页码：185 / 204

页数：20

共 50 条

[31] Numerical method for impulse control of piecewise deterministic Markov processes
de Saporta, Benoite
Dufour, Francois
AUTOMATICA, 2012, 48 (05) : 779 - 793
[32] Communicating piecewise deterministic Markov processes
Strubbe, Stefan
van der Schaft, Arjan
STOCHASTIC HYBRID SYSTEMS: THEORY AND SAFETY CRITICAL APPLICATIONS, 2006, 337 : 65 - 104
[33] Piecewise-deterministic Markov processes
Kazak, Jolanta
ANNALES POLONICI MATHEMATICI, 2013, 109 (03) : 279 - 296
[34] PIECEWISE DETERMINISTIC MARKOV-PROCESSES
CAI, HY
STOCHASTIC ANALYSIS AND APPLICATIONS, 1993, 11 (03) : 255 - 274
[35] Policy Iteration for Decentralized Control of Markov Decision Processes
Bernstein, Daniel S.
Amato, Christopher
Hansen, Eric A.
Zilberstein, Shlomo
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 89 - 132
[36] PIECEWISE-DETERMINISTIC MARKOV PROCESSES AS LIMITS OF MARKOV JUMP PROCESSES
Franz, Uwe
Liebscher, Volkmar
Zeiser, Stefan
ADVANCES IN APPLIED PROBABILITY, 2012, 44 (03) : 729 - 748
[37] CONSTRAINED AND UNCONSTRAINED OPTIMAL DISCOUNTED CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
Costa, O. L. V.
Dufour, F.
Piunovskiy, A. B.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2016, 54 (03) : 1444 - 1474
[38] Piecewise deterministic Markov control processes with feedback controls and unbounded costs
Forwick, L
Schäl, M
Schmitz, M
ACTA APPLICANDAE MATHEMATICAE, 2004, 82 (03) : 239 - 267
[39] Piecewise Deterministic Markov Control Processes with Feedback Controls and Unbounded Costs
Lothar Forwick
Manfred Schäl
Michael Schmitz
Acta Applicandae Mathematica, 2004, 82 : 239 - 267
[40] Piecewise Deterministic Markov Processes in Biological Models
Rudnicki, Ryszard
Tyran-Kaminska, Marta
SEMIGROUPS OF OPERATORS - THEORY AND APPLICATIONS, 2015, 113 : 235 - 255

← 1 2 3 4 5 →