共 13 条
[2]
Celiberto LA, 2008, LECT NOTES COMPUT SC, V5001, P220
[3]
Ernst D, 2005, J MACH LEARN RES, V6, P503
[4]
Hunt J.J., CONTINUOUS CONTROL D
[5]
Mnih V., PLAYING ATARI DEEP R
[7]
Riedmiller M, 2005, LECT NOTES ARTIF INT, V3720, P317, DOI 10.1007/11564096_32
[8]
Silver D., 2014, ICML ICML 14, P387
[9]
Sutton RS, 1996, ADV NEUR IN, V8, P1038
[10]
Sutton RS, 2000, ADV NEUR IN, V12, P1057