共 7 条
[2]
Bertsekas D. P., 1997, HEURISTICS, V3, P245
[3]
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[4]
Ross SM., 2014, Introduction to stochastic dynamic programming
[5]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[6]
TESAURO G, 1996, 1996 NEUR INF PROC S
[7]
Whittle P., 1982, Dynamic Programming and Stochastic Control, V1