Sample-path optimality and variance-minimization of average cost Markov control processes

被引:47
作者
Hernández-Lerma, O
Vega-Amaya, O
Carrasco, G
机构
[1] Inst Politecn Nacl, Ctr Invest & Estudios Avanzados, Dept Matemat, Mexico City 07000, DF, Mexico
[2] Sonoma State Univ, Dept Matemat, Hermosillo, Sonora, Mexico
[3] Univ Nacl Autonoma Mexico, Fac Ciencia, Dept Matemat, Mexico City 04510, DF, Mexico
关键词
(discrete-time) Markov control processes; average cost criteria; sample-path average cost; expected average cost; canonical policies; average variance;
D O I
10.1137/S0363012998340673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies several average-cost criteria for Markov control processes on Borel spaces with possibly unbounded costs. Under suitable hypotheses we show (i) the existence of a sample-path average cost (SPAC-) optimal stationary policy; (ii) a stationary policy is SPAC-optimal if and only if it is expected average cost (EAC-) optimal; and (iii) within the class of stationary SPAC-optimal (equivalently, EAC-optimal) policies there exists one with a minimal limiting average variance.
引用
收藏
页码:79 / 93
页数:15
相关论文
共 37 条
  • [1] [Anonymous], STOCHASTIC DYNAMIC P
  • [2] [Anonymous], 1992, Stochastic Stability of Markov chains
  • [3] DISCRETE-TIME CONTROLLED MARKOV-PROCESSES WITH AVERAGE COST CRITERION - A SURVEY
    ARAPOSTATHIS, A
    BORKAR, VS
    FERNANDEZGAUCHERAND, E
    GHOSH, MK
    MARCUS, SI
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1993, 31 (02) : 282 - 344
  • [4] Bertsekas D. P., 1987, DYNAMIC PROGRAMMING
  • [5] BORKAR VS, 1988, STOCHASTIC DIFFERENT, V10, P57
  • [6] BORKAR VS, 1991, KPITMAN RES NOTES MA, V240
  • [7] Cavazos-Cadena R., 1995, MATH METHOD OPER RES, V41, P89
  • [8] RECURRENCE CONDITIONS FOR AVERAGE AND BLACKWELL OPTIMALITY IN DENUMERABLE STATE MARKOV DECISION CHAINS
    DEKKER, R
    HORDIJK, A
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (02) : 271 - 289
  • [9] DUFLO M., 1990, Methodes Recursives Aleatoires
  • [10] Dynkin E.B., 1979, Grundlehren der Mathematischen Wissenschaften, V235