共 45 条
[1]
Achiam J., 2018, ARXIV180710299
[2]
Agarwal Rishabh., 2021, Deep reinforcement learning at the edge of the statistical precipice
[3]
[Anonymous], 2018, INT C MACH LEARN
[4]
Barber D., 2003, NIPS
[5]
Barreto A., 2016, ARXIV160605312
[6]
Beirlant Jan., 1997, INT J MATH STAT SCI, V6, P17
[7]
Brockman G., 2016, OPENAI GYM, P1
[8]
Burda Y., 2019, INT C LEARN REPR ICL
[9]
Caron M, 2020, ADV NEUR IN, V33
[10]
Chen Ting, 2020, INT C MACHINE LEARNI