A further remark on dynamic programming for partially observed Markov processes

被引:16
作者
Borkar, VS
Budhiraja, A
机构
[1] Tata Inst Fundamental Res, Sch Technol & Comp Sci, Bombay 400005, Maharashtra, India
[2] Univ N Carolina, Dept Stat, Chapel Hill, NC 27599 USA
关键词
controlled Markov processes; dynamic programming; partial observations; ergodic cost; vanishing discount; pseudo-atom;
D O I
10.1016/j.spa.2004.01.011
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In (Stochastic Process. Appl. 103 (2003) 293), a pair of dynamic programming inequalities were derived for the 'separated' ergodic control problem for partially observed Markov processes, using the 'vanishing discount' argument. In this note, we strengthen these results to derive a single dynamic programming equation for the same. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:79 / 93
页数:15
相关论文
共 14 条
[1]  
[Anonymous], 1991, PITMAN RES NOTES MAT
[2]  
Borkar V. S., 1989, Pitman Research Notes in Math. Series, V203
[3]   WHITE-NOISE REPRESENTATIONS IN STOCHASTIC-REALIZATION THEORY [J].
BORKAR, VS .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1993, 31 (05) :1093-1102
[4]   Average cost dynamic programming equations for controlled Markov chains with partial observations [J].
Borkar, VS .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2000, 39 (03) :673-681
[5]   Dynamic programming for ergodic control with partial observations [J].
Borkar, VS .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2003, 103 (02) :293-310
[6]   REMARKS ON THE EXISTENCE OF SOLUTIONS TO THE AVERAGE COST OPTIMALITY EQUATION IN MARKOV DECISION-PROCESSES [J].
FERNANDEZGAUCHERAND, E ;
ARAPOSTATHIS, A ;
MARCUS, SI .
SYSTEMS & CONTROL LETTERS, 1990, 15 (05) :425-432
[7]   OPTIMAL-CONTROL FOR PARTIALLY OBSERVED DIFFUSIONS [J].
FLEMING, WH ;
PARDOUX, E .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1982, 20 (02) :261-285
[8]  
Meyn SP., 1993, Stochastic Stability of Markov chains
[9]   OPTIMAL INFINITE-HORIZON UNDISCOUNTED CONTROL OF FINITE PROBABILISTIC SYSTEMS [J].
PLATZMAN, LK .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1980, 18 (04) :362-380
[10]  
RUNGGALDIER WJ, 1994, APPL MATHS MONOGRAPH, V6