Sample-path average optimality for Markov control processes

被引:9
|
作者
Lasserre, JB [1 ]
机构
[1] CNRS, LAAS, F-31077 Toulouse 4, France
关键词
borel spaces; Markov control (decision) processes; sample path average optimality;
D O I
10.1109/9.793787
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The authors consider a Markov control process with Borel state and actions spaces, unbounded costs, and under the long-run sample-path average cost criterion. They prove that under very weak assumptions on the transition law and a moment assumption for the one-step cost, there exists a stationary policy with invariant probability distribution nu, that is sample-path average cost optimal for nu-almost all initial states. In addition, every expected average-cost optimal stationary policy is in fact (liminf) sample-path average-cost optimal and strongly expected average-cost optimal.
引用
收藏
页码:1966 / 1971
页数:6
相关论文
共 50 条
  • [1] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, O
    Vega-Amaya, O
    Carrasco, G
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 38 (01) : 79 - 93
  • [2] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, Onésimo
    Vega-Amaya, Oscar
    Carrasco, Guadalupe
    SIAM Journal on Control and Optimization, 38 (01): : 79 - 93
  • [3] Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
    Zhu, Quan-xin
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (04): : 613 - 624
  • [4] Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
    Quan-xin Zhu
    Acta Mathematicae Applicatae Sinica, English Series, 2011, 27 : 613 - 624
  • [5] Sample-path and variance minimization of Markov control processes with average cost criteria
    Hernández-Lerma, O
    Vega-Amaya, O
    Carrasco, G
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1172 - 1176
  • [6] Sample-path optimality and variance-maximization for Markov decision processes
    Q. X. Zhu
    Mathematical Methods of Operations Research, 2007, 65 : 519 - 538
  • [7] Sample-path optimality and variance-maximization for Markov decision processes
    Zhu, Q. X.
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 65 (03) : 519 - 538
  • [8] A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
    Rolando Cavazos-Cadena
    Raúl Montes-de-Oca
    Karel Sladký
    Journal of Optimization Theory and Applications, 2014, 163 : 674 - 684
  • [9] A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
    Cavazos-Cadena, Rolando
    Montes-de-Oca, Raul
    Sladky, Karel
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2014, 163 (02) : 674 - 684
  • [10] Another set of conditions for Markov decision processes with average sample-path costs
    Zhu, Quanxin
    Guo, Xianping
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2006, 322 (02) : 1199 - 1214