共 23 条
[1]
[Anonymous], 1989, LEARNING DELAYED REW
[2]
[Anonymous], 1998, REINFORCEMENT LEARNI
[4]
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[5]
Bertsekas D.P., 1996, Athena Scientific, V7, P15
[6]
Brockmeier A. J., 2014, NEURAL COMPUT
[7]
Cobo L.C., 2011, IJCAI Proceedings-International Joint Conference on Artificial Intelligence, V22, P1243
[8]
Cortes C, 2012, J MACH LEARN RES, V13, P795
[9]
Fukumizu K, 2004, J MACH LEARN RES, V5, P73
[10]
Gabel T, 2005, LECT NOTES ARTIF INT, V3620, P206, DOI 10.1007/11536406_18