共 5 条
[1]
A time aggregation approach to Markov decision processes
[J].
AUTOMATICA,
2002, 38 (06)
:929-943
[3]
Puterman M.L., 2008, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Statistics
[5]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447