Policy iteration for average cost Markov control processes on Borel spaces

被引：15

作者：

HernandezLerma, O ^{[1
]}

Lasserre, JB ^{[1
]}

机构：

[1] CNRS,LAAS,F-31077 TOULOUSE,FRANCE

来源：

ACTA APPLICANDAE MATHEMATICAE | 1997年 / 47卷 / 02期

关键词：

(discrete-time) Markov control processes; average cost; policy iteration (aka Howard's algorithm);

D O I：

10.1023/A:1005781013253

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper studies the policy iteration algorithm (PIA) for average cost Markov control processes on Borel spaces. Two classes of MCPs are considered. One of them allows some restricted-growth unbounded cost functions and compact control constraint sets; the other one requires strictly unbounded costs and the control constraint sets may be non-compact. For each of these classes, the PIA yields, under suitable assumptions, the optimal (minimum) cost, an optimal stationary control policy, and a solution to the average cost optimality equation.

引用

收藏

页码：125 / 154

页数：30

相关论文

共 50 条

[31] Markov control processes with pathwise constraints [J].

Armando F. Mendoza-Pérez ;

Onésimo Hernández-Lerma .

Mathematical Methods of Operations Research, 2010, 71 :477-502

[32] The average cost of Markov chains subject to total variation distance uncertainty [J].

Malikopoulos, A. A. ;

Charalambous, C. D. ;

Tzortzis, I. .

SYSTEMS & CONTROL LETTERS, 2018, 120 :29-35

[33] Denumerable continuous-time Markov decision processes with multiconstraints on average costs [J].

Liu, Qiuli ;

Tan, Hangsheng ;

Guo, Xianping .

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) :576-585

[34] SUBSTANTIATING THE OPTIMAL DISTRIBUTION POLICY USING MARKOV DECISION PROCESSES [J].

Basanu, Gheorghe ;

Teleasa, Victor .

MANAGEMENT RESEARCH AND PRACTICE, 2010, 2 (04) :362-370

[35] Optimal control in light traffic Markov decision processes [J].

Ger Koole ;

Olaf Passchier .

Mathematical Methods of Operations Research, 1997, 45 :63-79

[36] Optimal control in light traffic Markov decision processes [J].

Koole, G ;

Passchier, O .

MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1997, 45 (01) :63-79

[37] Envelopes of sets of measures, tightness, and Markov control processes [J].

González-Hernández, J ;

Hernández-Lerma, O .

APPLIED MATHEMATICS AND OPTIMIZATION, 1999, 40 (03) :377-392

[38] ON THE MODELING OF IMPULSE CONTROL WITH RANDOM EFFECTS FOR CONTINUOUS MARKOV PROCESSES [J].

Helmes, Kurt l. ;

Stockbridge, Richard h. ;

Zhu, Chao .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2024, 62 (01) :699-723

[39] Monotonicity of minimizers in optimization problems with applications to Markov control processes [J].

Flores-Hernandez, Rosa M. ;

Montes-De-Oca, Raul .

KYBERNETIKA, 2007, 43 (03) :347-368

[40] ''Super-overtaking'' optimal policies for Markov control processes [J].

Gordienko, E .

SYSTEMS & CONTROL LETTERS, 1997, 31 (01) :59-64

← 1 2 3 4 5 →