共 14 条
[2]
BORKAR VK, 1984, SIAM J CONTROL OPTIM, V21, P965
[7]
HERNANDEZLERMA O, 1988, ADAPTIVE MARKOV CONT
[8]
HINDERER K, 1970, LECT NOTES OPERATION, V33
[9]
Loeve M., 1977, Probability Theory, Vi
[10]
Puterman M.L., 2008, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Statistics