TURNPIKES IN FINITE MARKOV DECISION PROCESSES AND RANDOM WALK*

被引:0
|
作者
Piunovskiy, A. B. [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool, England
关键词
Markov decision process; discounted reward; average reward; random walk; stochastic knapsack problem; turnpike;
D O I
10.1137/S0040585X97T991325
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we revise the theory of turnpikes in discounted Markov decision pro-cesses, prove the turnpike theorem for the undiscounted model, and apply the results to the specific random walk.
引用
收藏
页码:123 / 149
页数:27
相关论文
共 50 条
  • [41] The complexity of decentralized control of Markov decision processes
    Bernstein, DS
    Givan, R
    Immerman, N
    Zilberstein, S
    MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) : 819 - 840
  • [42] Reachability analysis of quantum Markov decision processes
    Ying, Shenggang
    Ying, Mingsheng
    INFORMATION AND COMPUTATION, 2018, 263 : 31 - 51
  • [43] Ranking policies in discrete Markov decision processes
    Dai, Peng
    Goldsmith, Judy
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2010, 59 (01) : 107 - 123
  • [44] BISIMULATION METRICS FOR CONTINUOUS MARKOV DECISION PROCESSES
    Ferns, Norm
    Panangaden, Prakash
    Precup, Doina
    SIAM JOURNAL ON COMPUTING, 2011, 40 (06) : 1662 - 1714
  • [45] Episodic task learning in Markov decision processes
    Yong Lin
    Fillia Makedon
    Yurong Xu
    Artificial Intelligence Review, 2011, 36 : 87 - 98
  • [46] Policy gradient in Lipschitz Markov Decision Processes
    Matteo Pirotta
    Marcello Restelli
    Luca Bascetta
    Machine Learning, 2015, 100 : 255 - 283
  • [47] Accounting for parametric uncertainty in Markov decision processes
    Schapaugh, Adam W.
    Tyre, Andrew J.
    ECOLOGICAL MODELLING, 2013, 254 : 15 - 21
  • [48] Variance minimization of parameterized Markov decision processes
    Li Xia
    Discrete Event Dynamic Systems, 2018, 28 : 63 - 81
  • [49] Markov decision processes in minimization of expected costs
    Rukav, Marija
    Strazanac, Kruno
    Suvak, Nenad
    Tomljanovic, Zoran
    CROATIAN OPERATIONAL RESEARCH REVIEW, 2014, 5 (02) : 247 - 257
  • [50] Ranking policies in discrete Markov decision processes
    Peng Dai
    Judy Goldsmith
    Annals of Mathematics and Artificial Intelligence, 2010, 59 : 107 - 123