共 9 条
[1]
DUFF M, 2002, THESIS U MASSASSACHU
[3]
Poupart P., 2008, P ISAIM, P1025
[4]
Poupart Pascal, 2006, ACM International Conference Proceeding Series, P697
[5]
Ross S, 2011, J MACH LEARN RES, V12, P1729
[6]
Ross Stephane, 2008, Uncertain Artif Intell, V2008, P476
[7]
Sutton R.S., 2017, Introduction to reinforcement learning
[8]
Wang Y., 2012, P ICML, P1135
[9]
Wu B., 2013, RUANJIAN XUEBAO, V24, P25