共 34 条
[1]
[Anonymous], 2013, Playing atari with deep reinforcement learning
[2]
[Anonymous], 2011, 2011001 CORE LIDAM U
[3]
[Anonymous], 2014, ICML ICML 14
[4]
[Anonymous], 2009, Advances in Neural Information Processing Systems
[5]
Argyriou A, 2009, J MACH LEARN RES, V10, P2507
[6]
Bertsekas D. P., 1999, NONLINEAR PROGRAMMIN, V2nd
[7]
Deisenroth MP., 2013, FOUND TRENDS ROBOT, V2, P1, DOI DOI 10.1561/2300000021
[8]
Durrett R., 2010, PROBABILITY THEORY E
[10]
Howard R. A., 1964, DYNAMIC PROGRAMMING