Policy iteration for average cost Markov control processes on Borel spaces

被引:15
作者
HernandezLerma, O [1 ]
Lasserre, JB [1 ]
机构
[1] CNRS,LAAS,F-31077 TOULOUSE,FRANCE
关键词
(discrete-time) Markov control processes; average cost; policy iteration (aka Howard's algorithm);
D O I
10.1023/A:1005781013253
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper studies the policy iteration algorithm (PIA) for average cost Markov control processes on Borel spaces. Two classes of MCPs are considered. One of them allows some restricted-growth unbounded cost functions and compact control constraint sets; the other one requires strictly unbounded costs and the control constraint sets may be non-compact. For each of these classes, the PIA yields, under suitable assumptions, the optimal (minimum) cost, an optimal stationary control policy, and a solution to the average cost optimality equation.
引用
收藏
页码:125 / 154
页数:30
相关论文
共 50 条
[41]   Risk sensitive control of Markov processes in countable state space [J].
HernandezHernandez, D ;
Marcus, SI .
SYSTEMS & CONTROL LETTERS, 1996, 29 (03) :147-155
[42]   Variance-minimization of Markov control processes with pathwise constraints [J].
Mendoza-Perez, Armando F. ;
Hernandez-Lerma, Onesimo .
OPTIMIZATION, 2012, 61 (12) :1427-1447
[43]   Structural properties for a two-state partially observable Markov decision process with an average cost criterion [J].
Goulionis, John E. .
JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2007, 10 (05) :715-733
[44]   Discrete-time average-cost mean-field games on Polish spaces [J].
Saldi, Naci .
TURKISH JOURNAL OF MATHEMATICS, 2020, 44 (02) :463-480
[45]   Strong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problems [J].
Feinberg, Eugene A. ;
Huang, Jefferson .
OPERATIONS RESEARCH LETTERS, 2013, 41 (03) :249-251
[46]   A survey of average cost problems in deterministic discrete-time control systems [J].
Hernandez-Lerma, Onesimo ;
Laura-Guarachi, Leonardo R. ;
Mendoza-Palacios, Saul .
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2023, 522 (01)
[47]   Stationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systems [J].
Vargas, Alessandro N. ;
Ishihara, Joao Y. ;
do Val, Joao B. R. .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (17) :2943-2957
[48]   Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion [J].
Wei, Qingda ;
Chen, Xian .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 197 (01) :309-333
[49]   Bias Optimality versus Strong 0-Discount Optimality in Markov Control Processes with Unbounded Costs [J].
Nadine Hilgert ;
Onésimo Hernández-Lerma .
Acta Applicandae Mathematica, 2003, 77 :215-235
[50]   Bias optimality versus strong 0-discount optimality in Markov control processes with unbounded costs [J].
Hilgert, N ;
Hernández-Lerma, O .
ACTA APPLICANDAE MATHEMATICAE, 2003, 77 (03) :215-235