TURNPIKES IN FINITE MARKOV DECISION PROCESSES AND RANDOM WALK*

被引：0

作者：

Piunovskiy, A. B. ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool, England

来源：

THEORY OF PROBABILITY AND ITS APPLICATIONS | 2023年 / 68卷 / 01期

关键词：

Markov decision process; discounted reward; average reward; random walk; stochastic knapsack problem; turnpike;

D O I：

10.1137/S0040585X97T991325

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In this paper we revise the theory of turnpikes in discounted Markov decision pro-cesses, prove the turnpike theorem for the undiscounted model, and apply the results to the specific random walk.

引用

页码：123 / 149

页数：27

共 50 条

[41] The complexity of decentralized control of Markov decision processes
Bernstein, DS
Givan, R
Immerman, N
Zilberstein, S
MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) : 819 - 840
[42] Reachability analysis of quantum Markov decision processes
Ying, Shenggang
Ying, Mingsheng
INFORMATION AND COMPUTATION, 2018, 263 : 31 - 51
[43] Ranking policies in discrete Markov decision processes
Dai, Peng
Goldsmith, Judy
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2010, 59 (01) : 107 - 123
[44] BISIMULATION METRICS FOR CONTINUOUS MARKOV DECISION PROCESSES
Ferns, Norm
Panangaden, Prakash
Precup, Doina
SIAM JOURNAL ON COMPUTING, 2011, 40 (06) : 1662 - 1714
[45] Episodic task learning in Markov decision processes
Yong Lin
Fillia Makedon
Yurong Xu
Artificial Intelligence Review, 2011, 36 : 87 - 98
[46] Policy gradient in Lipschitz Markov Decision Processes
Matteo Pirotta
Marcello Restelli
Luca Bascetta
Machine Learning, 2015, 100 : 255 - 283
[47] Accounting for parametric uncertainty in Markov decision processes
Schapaugh, Adam W.
Tyre, Andrew J.
ECOLOGICAL MODELLING, 2013, 254 : 15 - 21
[48] Variance minimization of parameterized Markov decision processes
Li Xia
Discrete Event Dynamic Systems, 2018, 28 : 63 - 81
[49] Markov decision processes in minimization of expected costs
Rukav, Marija
Strazanac, Kruno
Suvak, Nenad
Tomljanovic, Zoran
CROATIAN OPERATIONAL RESEARCH REVIEW, 2014, 5 (02) : 247 - 257
[50] Ranking policies in discrete Markov decision processes
Peng Dai
Judy Goldsmith
Annals of Mathematics and Artificial Intelligence, 2010, 59 : 107 - 123

← 1 2 3 4 5 →