共 23 条
[1]
[Anonymous], 1989, LEARNING DELAYED REW
[2]
[Anonymous], 1995, Optimal Control
[3]
[Anonymous], P 2009 IEEE INT C SY
[4]
Bertsekas D. P., 1996, Neuro-dynamic Programming
[5]
Bradtke SJ, 1994, P 1994 AM CONTR C 19
[10]
Landelius T., 1997, THESIS