共 8 条
[1]
BAIRD L, 1993, WLTR931147
[2]
Bertsekas D.P., 2005, DYNAMIC PROGRAMMING, V1
[3]
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[4]
Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V2
[6]
SUTTON RS, 1997, INTRO REINFORCEMENT
[7]
Terwiesch P., 1994, Journal of Process Control, V4, P238, DOI 10.1016/0959-1524(94)80045-6
[8]
Wilson JA, 1997, COMPUT CHEM ENG, V21, pS1233