共 7 条
- [2] Bertsekas D. P., 1997, HEURISTICS, V3, P245
- [3] Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
- [4] Ross SM., 2014, Introduction to stochastic dynamic programming
- [5] Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
- [6] TESAURO G, 1996, 1996 NEUR INF PROC S
- [7] Whittle P., 1982, Dynamic Programming and Stochastic Control, V1