Policy iteration for average cost Markov control processes on Borel spaces

被引:15
作者
HernandezLerma, O [1 ]
Lasserre, JB [1 ]
机构
[1] CNRS,LAAS,F-31077 TOULOUSE,FRANCE
关键词
(discrete-time) Markov control processes; average cost; policy iteration (aka Howard's algorithm);
D O I
10.1023/A:1005781013253
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper studies the policy iteration algorithm (PIA) for average cost Markov control processes on Borel spaces. Two classes of MCPs are considered. One of them allows some restricted-growth unbounded cost functions and compact control constraint sets; the other one requires strictly unbounded costs and the control constraint sets may be non-compact. For each of these classes, the PIA yields, under suitable assumptions, the optimal (minimum) cost, an optimal stationary control policy, and a solution to the average cost optimality equation.
引用
收藏
页码:125 / 154
页数:30
相关论文
共 50 条
[31]   Markov control processes with pathwise constraints [J].
Armando F. Mendoza-Pérez ;
Onésimo Hernández-Lerma .
Mathematical Methods of Operations Research, 2010, 71 :477-502
[32]   The average cost of Markov chains subject to total variation distance uncertainty [J].
Malikopoulos, A. A. ;
Charalambous, C. D. ;
Tzortzis, I. .
SYSTEMS & CONTROL LETTERS, 2018, 120 :29-35
[33]   Denumerable continuous-time Markov decision processes with multiconstraints on average costs [J].
Liu, Qiuli ;
Tan, Hangsheng ;
Guo, Xianping .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) :576-585
[34]   SUBSTANTIATING THE OPTIMAL DISTRIBUTION POLICY USING MARKOV DECISION PROCESSES [J].
Basanu, Gheorghe ;
Teleasa, Victor .
MANAGEMENT RESEARCH AND PRACTICE, 2010, 2 (04) :362-370
[35]   Optimal control in light traffic Markov decision processes [J].
Ger Koole ;
Olaf Passchier .
Mathematical Methods of Operations Research, 1997, 45 :63-79
[36]   Optimal control in light traffic Markov decision processes [J].
Koole, G ;
Passchier, O .
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1997, 45 (01) :63-79
[37]   Envelopes of sets of measures, tightness, and Markov control processes [J].
González-Hernández, J ;
Hernández-Lerma, O .
APPLIED MATHEMATICS AND OPTIMIZATION, 1999, 40 (03) :377-392
[38]   ON THE MODELING OF IMPULSE CONTROL WITH RANDOM EFFECTS FOR CONTINUOUS MARKOV PROCESSES [J].
Helmes, Kurt l. ;
Stockbridge, Richard h. ;
Zhu, Chao .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2024, 62 (01) :699-723
[39]   Monotonicity of minimizers in optimization problems with applications to Markov control processes [J].
Flores-Hernandez, Rosa M. ;
Montes-De-Oca, Raul .
KYBERNETIKA, 2007, 43 (03) :347-368
[40]   ''Super-overtaking'' optimal policies for Markov control processes [J].
Gordienko, E .
SYSTEMS & CONTROL LETTERS, 1997, 31 (01) :59-64