Optimal threshold probability in undiscounted Markov decision processes with a target set

被引：16

作者：

Ohtsubo, Y ^{[1
]}

机构：

[1] Kochi Univ, Fac Sci, Dept Math & Informat Sci, Kochi 7808520, Japan

来源：

APPLIED MATHEMATICS AND COMPUTATION | 2004年 / 149卷 / 02期

关键词：

Markov decision process; minimizing risk model; existence of optimal policy; value iteration; policy improvement method;

D O I：

10.1016/S0096-3003(03)00158-9

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

We consider risk minimizing problems in undiscounted Markov decisions processes with a target set. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists an stationary optimal policy. Also we give several value iteration methods and a policy improvement method. (C) 2003 Elsevier Inc. All rights reserved.

引用

页码：519 / 532

页数：14

共 50 条

[1] Optimal threshold probability and expectation in semi-Markov decision processes
Sakaguchi, Masahiko
Ohtsubo, Yoshio
APPLIED MATHEMATICS AND COMPUTATION, 2010, 216 (10) : 2947 - 2958
[2] UTILITY OPTIMAL POLICIES IN AN UNDISCOUNTED MARKOV DECISION PROCESS
JAQUETTE, SC
OPERATIONS RESEARCH, 1975, 23 : B353 - B353
[3] Markov decision processes associated with two threshold probability criteria
Masahiko SAKAGUCHI
Yoshio OHTSUBO
Journal of Control Theory and Applications, 2013, 11 (04) : 548 - 557
[4] Markov decision processes associated with two threshold probability criteria
Sakaguchi M.
Ohtsubo Y.
Journal of Control Theory and Applications, 2013, 11 (4): : 548 - 557
[5] MINIMIZING A THRESHOLD PROBABILITY IN DISCOUNTED MARKOV DECISION-PROCESSES
WHITE, DJ
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1993, 173 (02) : 634 - 646
[6] On undiscounted semi-Markov decision processes with absorbing states
Prasenjit Mondal
Mathematical Methods of Operations Research, 2016, 83 : 161 - 177
[7] Economic MPC of Markov Decision Processes: Dissipativity in undiscounted infinite-horizon optimal control
Gros, Sebastien
Zanon, Mario
AUTOMATICA, 2022, 146
[8] On undiscounted semi-Markov decision processes with absorbing states
Mondal, Prasenjit
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 83 (02) : 161 - 177
[9] Constrained total undiscounted continuous-time Markov decision processes
Guo, Xianping
Zhang, Yi
BERNOULLI, 2017, 23 (03) : 1694 - 1736
[10] ON THE EXISTENCE OF RELATIVE VALUES FOR UNDISCOUNTED MULTICHAIN MARKOV DECISION-PROCESSES
SCHWEITZER, PJ
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1984, 102 (02) : 449 - 455

← 1 2 3 4 5 →