Policy iteration for average cost Markov control processes on Borel spaces

被引：15

作者：

HernandezLerma, O ^{[1
]}

Lasserre, JB ^{[1
]}

机构：

[1] CNRS,LAAS,F-31077 TOULOUSE,FRANCE

来源：

ACTA APPLICANDAE MATHEMATICAE | 1997年 / 47卷 / 02期

关键词：

(discrete-time) Markov control processes; average cost; policy iteration (aka Howard's algorithm);

D O I：

10.1023/A:1005781013253

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper studies the policy iteration algorithm (PIA) for average cost Markov control processes on Borel spaces. Two classes of MCPs are considered. One of them allows some restricted-growth unbounded cost functions and compact control constraint sets; the other one requires strictly unbounded costs and the control constraint sets may be non-compact. For each of these classes, the PIA yields, under suitable assumptions, the optimal (minimum) cost, an optimal stationary control policy, and a solution to the average cost optimality equation.

引用

页码：125 / 154

页数：30

共 50 条

[41] Risk sensitive control of Markov processes in countable state space [J].

HernandezHernandez, D ;

Marcus, SI .

SYSTEMS & CONTROL LETTERS, 1996, 29 (03) :147-155

[42] Variance-minimization of Markov control processes with pathwise constraints [J].

Mendoza-Perez, Armando F. ;

Hernandez-Lerma, Onesimo .

OPTIMIZATION, 2012, 61 (12) :1427-1447

[43] Structural properties for a two-state partially observable Markov decision process with an average cost criterion [J].

Goulionis, John E. .

JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2007, 10 (05) :715-733

[44] Discrete-time average-cost mean-field games on Polish spaces [J].

Saldi, Naci .

TURKISH JOURNAL OF MATHEMATICS, 2020, 44 (02) :463-480

[45] Strong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problems [J].

Feinberg, Eugene A. ;

Huang, Jefferson .

OPERATIONS RESEARCH LETTERS, 2013, 41 (03) :249-251

[46] A survey of average cost problems in deterministic discrete-time control systems [J].

Hernandez-Lerma, Onesimo ;

Laura-Guarachi, Leonardo R. ;

Mendoza-Palacios, Saul .

JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2023, 522 (01)

[47] Stationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systems [J].

Vargas, Alessandro N. ;

Ishihara, Joao Y. ;

do Val, Joao B. R. .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (17) :2943-2957

[48] Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion [J].

Wei, Qingda ;

Chen, Xian .

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 197 (01) :309-333

[49] Bias Optimality versus Strong 0-Discount Optimality in Markov Control Processes with Unbounded Costs [J].

Nadine Hilgert ;

Onésimo Hernández-Lerma .

Acta Applicandae Mathematica, 2003, 77 :215-235

[50] Bias optimality versus strong 0-discount optimality in Markov control processes with unbounded costs [J].

Hilgert, N ;

Hernández-Lerma, O .

ACTA APPLICANDAE MATHEMATICAE, 2003, 77 (03) :215-235

← 1 2 3 4 5 →