Sample-path average optimality for Markov control processes

被引：9

作者：

Lasserre, JB ^{[1
]}

机构：

[1] CNRS, LAAS, F-31077 Toulouse 4, France

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 1999年 / 44卷 / 10期

关键词：

borel spaces; Markov control (decision) processes; sample path average optimality;

D O I：

10.1109/9.793787

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The authors consider a Markov control process with Borel state and actions spaces, unbounded costs, and under the long-run sample-path average cost criterion. They prove that under very weak assumptions on the transition law and a moment assumption for the one-step cost, there exists a stationary policy with invariant probability distribution nu, that is sample-path average cost optimal for nu-almost all initial states. In addition, every expected average-cost optimal stationary policy is in fact (liminf) sample-path average-cost optimal and strongly expected average-cost optimal.

引用

页码：1966 / 1971

页数：6

共 50 条

[1] Sample-path optimality and variance-minimization of average cost Markov control processes
Hernández-Lerma, O
Vega-Amaya, O
Carrasco, G
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) : 79 - 93
[2] Sample-path optimality and variance-minimization of average cost Markov control processes
Hernández-Lerma, Onésimo
Vega-Amaya, Oscar
Carrasco, Guadalupe
SIAM Journal on Control and Optimization, 38 (01): : 79 - 93
[3] Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
Zhu, Quan-xin
ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (04): : 613 - 624
[4] Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
Quan-xin Zhu
Acta Mathematicae Applicatae Sinica, English Series, 2011, 27 : 613 - 624
[5] Sample-path and variance minimization of Markov control processes with average cost criteria
Hernández-Lerma, O
Vega-Amaya, O
Carrasco, G
PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1172 - 1176
[6] Sample-path optimality and variance-maximization for Markov decision processes
Q. X. Zhu
Mathematical Methods of Operations Research, 2007, 65 : 519 - 538
[7] Sample-path optimality and variance-maximization for Markov decision processes
Zhu, Q. X.
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 65 (03) : 519 - 538
[8] A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
Rolando Cavazos-Cadena
Raúl Montes-de-Oca
Karel Sladký
Journal of Optimization Theory and Applications, 2014, 163 : 674 - 684
[9] A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
Cavazos-Cadena, Rolando
Montes-de-Oca, Raul
Sladky, Karel
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2014, 163 (02) : 674 - 684
[10] Another set of conditions for Markov decision processes with average sample-path costs
Zhu, Quanxin
Guo, Xianping
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2006, 322 (02) : 1199 - 1214

← 1 2 3 4 5 →