Singular Control for Discounted Markov Decision Processes in a General State Space

被引:0
作者
Costa, O. L. V. [1 ]
Dufour, F. [2 ]
机构
[1] Univ Sao Paulo, Dept Engn Telecomunicacoes & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil
[2] Univ Bordeaux 1, MAB, F-33405 Talence, France
来源
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC) | 2011年
关键词
DISCRETE-TIME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the asymptotic optimality of discrete-time Markov Decision Processes (MDP's in short) with general state space and action space and having weak and strong interactions. By using a similar approach as developed in [1], the idea in this paper is to consider a MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter epsilon > 0 in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as epsilon goes to zero. In the sequel it is shown that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal.
引用
收藏
页码:7087 / 7092
页数:6
相关论文
共 13 条
[1]   PERTURBATION AND STABILITY THEORY FOR MARKOV CONTROL-PROBLEMS [J].
ABBAD, M ;
FILAR, JA .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1992, 37 (09) :1415-1420
[2]   ALGORITHMS FOR SINGULARLY PERTURBED LIMITING AVERAGE MARKOV CONTROL-PROBLEMS [J].
ABBAD, M ;
FILAR, JA ;
BIELECKI, TR .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1992, 37 (09) :1421-1425
[3]  
[Anonymous], APPL MATH
[4]  
Avrachenkov K.E., 2002, INT SER OPER RES MAN, P113, DOI 10.1007/978-1-4615-0805-2
[5]   SINGULARLY PERTURBED MARKOV CONTROL PROBLEM: LIMITING AVERAGE COST [J].
Bielecki, Tomasz R. ;
Filar, Jerzy A. .
ANNALS OF OPERATIONS RESEARCH, 1991, 28 (01) :153-168
[6]   Ergodic control of a singularly perturbed Markov process in discrete time with general state and compact action spaces [J].
Bielecki, TR ;
Stettner, L .
APPLIED MATHEMATICS AND OPTIMIZATION, 1998, 38 (03) :261-281
[7]  
Costa O., 2011, SINGULAR DISCO UNPUB, P21
[8]  
Gordienko E., 1995, APPL MATH, V23
[9]  
Gordienko E., 1995, APPL MATH, V23
[10]  
Hernandez-Lerma O., 1996, APPL MATH, V30