Considerable numerical experience indicates that the standard value iteration procedure for infinite horizon discounted Markov decision processes performs much better than the usual error bound analysis suggests. This paper attempts to examine why this happens and introduces an additional pointwise convergence concept to that of the usual maximum norm concept, in order to examine why some states exhibit better convergence behaviour than others. We also present some numerical results. (C) 1994 Academic Press, Inc.
机构:
TEL AVIV UNIV,RAYMOND & BEVERLY SACKLER FAC EXACT SCI,DEPT STAT,IL-69978 TEL AVIV,ISRAELTEL AVIV UNIV,RAYMOND & BEVERLY SACKLER FAC EXACT SCI,DEPT STAT,IL-69978 TEL AVIV,ISRAEL
HERZBERG, M
YECHIALI, U
论文数: 0引用数: 0
h-index: 0
机构:
TEL AVIV UNIV,RAYMOND & BEVERLY SACKLER FAC EXACT SCI,DEPT STAT,IL-69978 TEL AVIV,ISRAELTEL AVIV UNIV,RAYMOND & BEVERLY SACKLER FAC EXACT SCI,DEPT STAT,IL-69978 TEL AVIV,ISRAEL