Markov Decision Processes with Exogenous Variables

被引:4
作者
Bray, Robert L. [1 ]
机构
[1] Northwestern Univ, Kellogg Sch Management, Evanston, IL 60208 USA
关键词
Markov decision process; dynamic programming; endogenous value iteration; relative value iteration; exogenous variables; CONVERGENCE;
D O I
10.1287/mnsc.2018.3158
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
I present two algorithms for solving dynamic programs with exogenous variables: endogenous value iteration and endogenous policy iteration. These algorithms are always at least as fast as relative value iteration and relative policy iteration, and they are faster when the endogenous variables converge to their stationary distributions sooner than the exogenous variables.
引用
收藏
页码:4598 / 4606
页数:9
相关论文
共 15 条
[1]   Optimality of Quasi-Open-Loop Policies for Discounted Semi-Markov Decision Processes [J].
Adelman, Daniel ;
Mancini, Angelo J. .
MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) :1222-1247
[2]  
Aguirregabiria V, 2013, ADV ECONOMETRICS, V31, P3, DOI 10.1108/S0731-9053(2013)0000032001
[3]   Dynamic discrete choice structural models: A survey [J].
Aguirregabiria, Victor ;
Mira, Pedro .
JOURNAL OF ECONOMETRICS, 2010, 156 (01) :38-67
[4]  
[Anonymous], 2013, Matrix Analysis
[5]  
[Anonymous], 2018, WORKING PAPER
[6]  
Bray RL, 2019, QUANT EC
[7]  
Chen C.-H., 2017, WORKING PAPER
[8]  
Hajnal J, 1957, MATH P CAMBRIDGE PHI, V54, P233
[9]   DETERMINISTIC EQUIVALENCE IN STOCHASTIC INFINITE HORIZON PROBLEMS [J].
HIGLE, JL ;
BEAN, JC ;
SMITH, RL .
MATHEMATICS OF OPERATIONS RESEARCH, 1990, 15 (03) :396-407
[10]   ASYMPTOTIC CONVERGENCE RATE OF COST DIFFERENCES FOR MARKOVIAN DECISION PROCESSES [J].
MORTON, TE .
OPERATIONS RESEARCH, 1971, 19 (01) :244-&