Sample-path optimality and variance-minimization of average cost Markov control processes

被引：47

作者：

Hernández-Lerma, O

Vega-Amaya, O

Carrasco, G

机构：

[1] Inst Politecn Nacl, Ctr Invest & Estudios Avanzados, Dept Matemat, Mexico City 07000, DF, Mexico

[2] Sonoma State Univ, Dept Matemat, Hermosillo, Sonora, Mexico

[3] Univ Nacl Autonoma Mexico, Fac Ciencia, Dept Matemat, Mexico City 04510, DF, Mexico

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 1999年 / 38卷 / 01期

关键词：

(discrete-time) Markov control processes; average cost criteria; sample-path average cost; expected average cost; canonical policies; average variance;

D O I：

10.1137/S0363012998340673

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies several average-cost criteria for Markov control processes on Borel spaces with possibly unbounded costs. Under suitable hypotheses we show (i) the existence of a sample-path average cost (SPAC-) optimal stationary policy; (ii) a stationary policy is SPAC-optimal if and only if it is expected average cost (EAC-) optimal; and (iii) within the class of stationary SPAC-optimal (equivalently, EAC-optimal) policies there exists one with a minimal limiting average variance.

引用

页码：79 / 93

页数：15

共 37 条

[1] [Anonymous], STOCHASTIC DYNAMIC P
[2] [Anonymous], 1992, Stochastic Stability of Markov chains
[3] DISCRETE-TIME CONTROLLED MARKOV-PROCESSES WITH AVERAGE COST CRITERION - A SURVEY
ARAPOSTATHIS, A
BORKAR, VS
FERNANDEZGAUCHERAND, E
GHOSH, MK
MARCUS, SI
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1993, 31 (02) : 282 - 344
[4] Bertsekas D. P., 1987, DYNAMIC PROGRAMMING
[5] BORKAR VS, 1988, STOCHASTIC DIFFERENT, V10, P57
[6] BORKAR VS, 1991, KPITMAN RES NOTES MA, V240
[7] Cavazos-Cadena R., 1995, MATH METHOD OPER RES, V41, P89
[8] RECURRENCE CONDITIONS FOR AVERAGE AND BLACKWELL OPTIMALITY IN DENUMERABLE STATE MARKOV DECISION CHAINS
DEKKER, R
HORDIJK, A
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (02) : 271 - 289
[9] DUFLO M., 1990, Methodes Recursives Aleatoires
[10] Dynkin E.B., 1979, Grundlehren der Mathematischen Wissenschaften, V235

← 1 2 3 4 →