Singular Control for Discounted Markov Decision Processes in a General State Space

被引：0

作者：

Costa, O. L. V. ^{[1
]}

Dufour, F. ^{[2
]}

机构：

[1] Univ Sao Paulo, Dept Engn Telecomunicacoes & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil

[2] Univ Bordeaux 1, MAB, F-33405 Talence, France

来源：

2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC) | 2011年

关键词：

DISCRETE-TIME;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the asymptotic optimality of discrete-time Markov Decision Processes (MDP's in short) with general state space and action space and having weak and strong interactions. By using a similar approach as developed in [1], the idea in this paper is to consider a MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter epsilon > 0 in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as epsilon goes to zero. In the sequel it is shown that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal.

引用

页码：7087 / 7092

页数：6

共 13 条

[1] PERTURBATION AND STABILITY THEORY FOR MARKOV CONTROL-PROBLEMS [J].