On mean reward variance in semi-Markov processes

被引:0
作者
Karel Sladký
机构
[1] Academy of Sciences of the Czech Republic,Institute of Information Theory and Automation
来源
Mathematical Methods of Operations Research | 2005年 / 62卷
关键词
Markov and semi-Markov processes with rewards; Variance of cumulative reward; Asymptotic behaviour; Primary 90C47; Secondary 60J27;
D O I
暂无
中图分类号
学科分类号
摘要
As an extension of the discrete-time case, this note investigates the variance of the total cumulative reward for the embedded Markov chain of semi-Markov processes. Under the assumption that the chain is aperiodic and contains a single class of recurrent states recursive formulae for the variance are obtained which show that the variance growth rate is asymptotically linear in time. Expressions are provided to compute this growth rate.
引用
收藏
页码:387 / 397
页数:10
相关论文
共 16 条
  • [11] Kawai H(1982)The variance of discounted Markov decision processes J Appl Probab 19 794-802
  • [12] Kurano M(1985)Maximal mean/standard deviation ratio in an undiscounted MDP Oper Res Lett 4 157-159
  • [13] Mandl P(1988)Mean variance and probability criteria in finite Markov decision processes: a review J Optim Theory Appl 56 1-29
  • [14] Sobel MJ(undefined)undefined undefined undefined undefined-undefined
  • [15] Sobel MJ(undefined)undefined undefined undefined undefined-undefined
  • [16] White DJ(undefined)undefined undefined undefined undefined-undefined