共 22 条
[1]
Bertsekas DP, 2012, DYNAMIC PROGRAMMING, V2
[3]
Basic ideas for event-based optimization of Markov systems
[J].
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS,
2005, 15 (02)
:169-197
[7]
From perturbation analysis to Markov decision processes and reinforcement learning
[J].
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS,
2003, 13 (1-2)
:9-39
[9]
CHUNG KL, 1960, MARKOV CHAINS WITH S
[10]
Feinberg EA, 2002, HDB MARKOV DECISION