Sample-path optimality and variance-minimization of average cost Markov control processes

被引:47
|
作者
Hernández-Lerma, O
Vega-Amaya, O
Carrasco, G
机构
[1] Inst Politecn Nacl, Ctr Invest & Estudios Avanzados, Dept Matemat, Mexico City 07000, DF, Mexico
[2] Sonoma State Univ, Dept Matemat, Hermosillo, Sonora, Mexico
[3] Univ Nacl Autonoma Mexico, Fac Ciencia, Dept Matemat, Mexico City 04510, DF, Mexico
关键词
(discrete-time) Markov control processes; average cost criteria; sample-path average cost; expected average cost; canonical policies; average variance;
D O I
10.1137/S0363012998340673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies several average-cost criteria for Markov control processes on Borel spaces with possibly unbounded costs. Under suitable hypotheses we show (i) the existence of a sample-path average cost (SPAC-) optimal stationary policy; (ii) a stationary policy is SPAC-optimal if and only if it is expected average cost (EAC-) optimal; and (iii) within the class of stationary SPAC-optimal (equivalently, EAC-optimal) policies there exists one with a minimal limiting average variance.
引用
收藏
页码:79 / 93
页数:15
相关论文
共 50 条
  • [1] Sample-path optimality and variance-minimization of average cost Markov control processes
    Hernández-Lerma, Onésimo
    Vega-Amaya, Oscar
    Carrasco, Guadalupe
    SIAM Journal on Control and Optimization, 38 (01): : 79 - 93
  • [2] Sample-path and variance minimization of Markov control processes with average cost criteria
    Hernández-Lerma, O
    Vega-Amaya, O
    Carrasco, G
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 1172 - 1176
  • [3] Sample-path average optimality for Markov control processes
    Lasserre, JB
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1999, 44 (10) : 1966 - 1971
  • [4] Sample-path optimality and variance-maximization for Markov decision processes
    Q. X. Zhu
    Mathematical Methods of Operations Research, 2007, 65 : 519 - 538
  • [5] Sample-path optimality and variance-maximization for Markov decision processes
    Zhu, Q. X.
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 65 (03) : 519 - 538
  • [6] Variance-minimization of Markov control processes with pathwise constraints
    Mendoza-Perez, Armando F.
    Hernandez-Lerma, Onesimo
    OPTIMIZATION, 2012, 61 (12) : 1427 - 1447
  • [7] Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
    Zhu, Quan-xin
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (04): : 613 - 624
  • [8] Average sample-path optimality for continuous-time Markov decision processes in Polish spaces
    Quan-xin Zhu
    Acta Mathematicae Applicatae Sinica, English Series, 2011, 27 : 613 - 624
  • [9] A Sensitivity-Based Construction Approach to Sample-Path Variance Minimization of Markov Decision Processes
    Huang, Yonghao
    Chen, Xi
    2012 2ND AUSTRALIAN CONTROL CONFERENCE (AUCC), 2012, : 215 - 220
  • [10] A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
    Rolando Cavazos-Cadena
    Raúl Montes-de-Oca
    Karel Sladký
    Journal of Optimization Theory and Applications, 2014, 163 : 674 - 684