Stochastic shortest path problems with associative accumulative criteria

被引:5
作者
Ohtsubo, Yoshio [1 ]
机构
[1] Kochi Univ, Fac Sci, Dept Math, Kochi 7808520, Japan
关键词
shortest path problem; Markov decision process; associative criterion; invariant imbedding method; optimality equation; existence of optimal policy;
D O I
10.1016/j.amc.2007.08.025
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We consider a stochastic shortest path problem with associative criteria in which for each node of a graph we choose a probability distribution over the set of successor nodes so as to reach a given target node optimally. We formulate such a problem as an associative Markov decision processes. We show that an optimal value function is a unique solution to an optimality equation and find an optimal stationary policy. Also we give a value iteration method and a policy improvement method. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:198 / 208
页数:11
相关论文
共 20 条
[1]  
[Anonymous], J FLUIDS ENG
[2]  
BELLMAN RE, 1970, MANAGE SCI B-APPL, V17, pB141
[3]   AN ANALYSIS OF STOCHASTIC SHORTEST-PATH PROBLEMS [J].
BERTSEKAS, DP ;
TSITSIKLIS, JN .
MATHEMATICS OF OPERATIONS RESEARCH, 1991, 16 (03) :580-595
[4]   ON SEQUENTIAL DECISIONS AND MARKOV-CHAINS [J].
DERMAN, C .
MANAGEMENT SCIENCE, 1962, 9 (01) :16-24
[5]  
DERMAN C, 1970, FINITE STATE MARKOVI
[6]   Conditional decision-making in fuzzy environment [J].
Iwamoto, S ;
Tsurusaki, K ;
Fujita, T .
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1999, 42 (02) :198-218
[7]   Stochastic decision-making in a fuzzy environment [J].
Iwamoto, S ;
Fujita, T .
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1995, 38 (04) :467-482
[8]   On Markov policies for minimax decision processes [J].
Iwamoto, S ;
Tsurusaki, K .
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2001, 253 (01) :58-78
[9]  
Maruyama Y., 1999, Bulletin of Informatics and Cybernetics, V31, P147
[10]  
MARUYAMA Y, 1999, MATH JPN, V50, P469