A COUNTEREXAMPLE ON THE OPTIMALITY EQUATION IN MARKOV DECISION CHAINS WITH THE AVERAGE COST CRITERION

被引:20
作者
CAVAZOSCADENA, R [1 ]
机构
[1] UNIV AUTONOMA AGR ANTONIO NARRO,DEPT ESTADIS & CALCULO,SALTILLO 25315,COAHUILA,MEXICO
关键词
MARKOV DECISION PROCESSES; AVERAGE COST CRITERION; OPTIMAL STATIONARY POLICIES; COUNTEREXAMPLE ON THE AVERAGE COST OPTIMALITY EQUATION;
D O I
10.1016/0167-6911(91)90060-R
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider Markov decision processes with denumerable state space and finite control sets; the performance index of a control policy is a long-run expected average cost criterion and the cost function is bounded below. For these models, the existence of average optimal stationary policies was recently established in [11] under very general assumptions. Such a result was obtained via an optimality inequality. Here, we use a simple example to prove that the conditions in [11] do not imply the existence of a solution to the average cost optimality equation.
引用
收藏
页码:387 / 392
页数:6
相关论文
共 12 条
[1]  
Ash R. B., 2014, REAL ANAL PROBABILIT
[2]   NECESSARY AND SUFFICIENT CONDITIONS FOR A BOUNDED SOLUTION TO THE OPTIMALITY EQUATION IN AVERAGE REWARD MARKOV DECISION CHAINS [J].
CAVAZOSCADENA, R .
SYSTEMS & CONTROL LETTERS, 1988, 10 (01) :71-78
[3]  
CAVAZOSCADENA R, 1989, KYBERNETIKA, V25, P145
[4]  
CAVAZOSCADENA R, 1990, IN PRESS KYBERNETIKA
[5]  
CAVAZOSCADENA R, UNPUB COMP RECENT AS
[6]  
HERNANDEZLERMA O, 1990, ADAPTIVE MARKOV CONT
[7]  
Loeve M., 1977, PROBABILITY THEORY, V1
[8]  
ROSS SM, 1970, APPLIED PROBABILITY