共 11 条
[1]
AZAR M. G., 2011, ADV NEURAL INFORM PR, P2411
[2]
Bellemare MG, 2017, PR MACH LEARN RES, V70
[4]
Even-Dar E, 2003, J MACH LEARN RES, V5, P1
[5]
GHAVAMZADEH M., 2011, REINFORCEMENT LEARNI
[6]
Kallenberg O, 2017, PROB THEOR STOCH MOD, V77, P1, DOI 10.1007/978-3-319-41598-7
[7]
Lyle C, 2019, Arxiv, DOI arXiv:1901.11084
[8]
Rowland M., 2018, INT C ARTIFICIAL INT, P29
[9]
Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1007/BF00115009
[10]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1