共 40 条
[11]
Bakker B, 2010, STUD COMPUT INTELL, V281, P475
[15]
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[16]
Busoniu L, 2010, STUD COMPUT INTELL, V310, P183
[19]
Dulac-Arnold Gabriel, 2015, Deep reinforcement learning in large discrete action spaces