Constrained average cost Markov control processes in Borel spaces

被引:44
|
作者
Hernández-Lerma, O
González-Hernández, J
López-Martínez, RR
机构
[1] Inst Politecn Nacl, Ctr Invest & Estudios Avanzados, Dept Matemat, Mexico City 07000, DF, Mexico
[2] Univ Nacl Autonoma Mexico, Dept Probabil & Estadist, IIMAS, Mexico City 01000, DF, Mexico
[3] Univ Veracruzana, Fac Matemat, Xalapa 91090, Veracruz, Mexico
关键词
constrained Markov control processes; average cost; discounted cost;
D O I
10.1137/S0363012999361627
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers constrained Markov control processes in Borel spaces, with unbounded costs. The criterion to be minimized is a long-run expected average cost, and the constraints can be imposed on similar average costs, or on average rewards, or discounted costs or rewards. We give conditions under which the constrained problem (CP) is solvable and equivalent to an equality constrained (EC) linear program. Furthermore, we show that there is no duality gap between EC and the dual program EC* and that in fact the strong duality condition holds. Finally, we introduce an explicit procedure to solve CP in some cases which is illustrated with a detailed example.
引用
收藏
页码:442 / 468
页数:27
相关论文
共 50 条
  • [1] Value iteration in average cost Markov control processes on borel spaces
    MontesdeOca, R
    HernandezLerma, O
    ACTA APPLICANDAE MATHEMATICAE, 1996, 42 (02) : 203 - 222
  • [2] Policy iteration for average cost Markov control processes on Borel spaces
    HernandezLerma, O
    Lasserre, JB
    ACTA APPLICANDAE MATHEMATICAE, 1997, 47 (02) : 125 - 154
  • [3] Policy Iteration for Average Cost Markov Control Processes on Borel Spaces
    Onésimo Hernández-Lerma
    Jean B. Lasserre
    Acta Applicandae Mathematica, 1997, 47 : 125 - 154
  • [4] THE AVERAGE COST OPTIMALITY EQUATION FOR MARKOV CONTROL PROCESSES ON BOREL SPACES
    MONTESDEOCA, R
    SYSTEMS & CONTROL LETTERS, 1994, 22 (05) : 351 - 357
  • [5] Value Iteration for Average Cost Markov Decision Processes in Borel Spaces
    Zhu, Quanxin
    Guo, Xianping
    APPLIED MATHEMATICS RESEARCH EXPRESS, 2005, (02) : 61 - 76
  • [6] Constrained Markov control processes in Borel spaces:: the discounted case
    Hernández-Lerma, O
    González-Hernández, J
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2000, 52 (02) : 271 - 285
  • [7] Constrained Markov control processes in Borel spaces: the discounted case
    Onésimo Hernández-Lerma
    Juan González-Hernández
    Mathematical Methods of Operations Research, 2000, 52 : 271 - 285
  • [8] Constrained Markov decision processes in Borel spaces: from discounted to average optimality
    Mendoza-Perez, Armando F.
    Jasso-Fuentes, Hector
    De-la-Cruz Courtois, Omar A.
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 84 (03) : 489 - 525
  • [9] Constrained Markov decision processes in Borel spaces: from discounted to average optimality
    Armando F. Mendoza-Pérez
    Héctor Jasso-Fuentes
    Omar A. De-la-Cruz Courtois
    Mathematical Methods of Operations Research, 2016, 84 : 489 - 525
  • [10] Approximation of Constrained Average Cost Markov Control Processes
    Sutter, Tobias
    Esfahani, Peyman Mohajerin
    Lygeros, John
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 6597 - 6602