共 9 条
Average cost dynamic programming equations for controlled Markov chains with partial observations
被引:43
|作者:
Borkar, VS
[1
]
机构:
[1] Tata Inst Fundamental Res, Sch Technol & Comp Sci, Mumbai 400005, India
关键词:
average cost control;
controlled Markov chains;
partial observations;
dynamic programming;
vanishing discount limit;
D O I:
10.1137/S0363012998345172
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The value function for the average cost control of class of partially observed Markov chains is derived as the vanishing discount limit, in suitable sense, of the value functions for the corresponding discounted cost problems. The limiting procedure is justified by bounds derived using a simple coupling argument.
引用
收藏
页码:673 / 681
页数:9
相关论文