Markov Decision Processes with Exogenous Variables

被引：4

作者：

Bray, Robert L. ^{[1
]}

机构：

[1] Northwestern Univ, Kellogg Sch Management, Evanston, IL 60208 USA

来源：

MANAGEMENT SCIENCE | 2019年 / 65卷 / 10期

关键词：

Markov decision process; dynamic programming; endogenous value iteration; relative value iteration; exogenous variables; CONVERGENCE;

D O I：

10.1287/mnsc.2018.3158

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

I present two algorithms for solving dynamic programs with exogenous variables: endogenous value iteration and endogenous policy iteration. These algorithms are always at least as fast as relative value iteration and relative policy iteration, and they are faster when the endogenous variables converge to their stationary distributions sooner than the exogenous variables.

引用

页码：4598 / 4606

页数：9

共 15 条

[1] Optimality of Quasi-Open-Loop Policies for Discounted Semi-Markov Decision Processes [J].

Adelman, Daniel ;

Mancini, Angelo J. .

MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) :1222-1247

[2]

Aguirregabiria V, 2013, ADV ECONOMETRICS, V31, P3, DOI 10.1108/S0731-9053(2013)0000032001

[3] Dynamic discrete choice structural models: A survey [J].

Aguirregabiria, Victor ;

Mira, Pedro .

JOURNAL OF ECONOMETRICS, 2010, 156 (01) :38-67

[4]

[Anonymous], 2013, Matrix Analysis

[5]

[Anonymous], 2018, WORKING PAPER

[6]

Bray RL, 2019, QUANT EC

[7]

Chen C.-H., 2017, WORKING PAPER

[8]

Hajnal J, 1957, MATH P CAMBRIDGE PHI, V54, P233

[9] DETERMINISTIC EQUIVALENCE IN STOCHASTIC INFINITE HORIZON PROBLEMS [J].

HIGLE, JL ;

BEAN, JC ;

SMITH, RL .

MATHEMATICS OF OPERATIONS RESEARCH, 1990, 15 (03) :396-407

[10] ASYMPTOTIC CONVERGENCE RATE OF COST DIFFERENCES FOR MARKOVIAN DECISION PROCESSES [J].

MORTON, TE .

OPERATIONS RESEARCH, 1971, 19 (01) :244-&

← 1 2 →