共 42 条
- [1] [Anonymous], 2017, OpenAI Baselines
- [2] [Anonymous], 2016, CORR
- [3] [Anonymous], 2014, ICML ICML 14
- [4] [Anonymous], 2017, CoRR
- [5] [Anonymous], 1998, REINFORCEMENT LEARNI
- [6] [Anonymous], ADV NEURAL INFORM PR
- [7] Anschel O, 2017, 34 INT C MACHINE LEA, V70
- [8] Barth-Maron G., 2018, P INT C LEARN REPR
- [9] Bellemare MG, 2017, PR MACH LEARN RES, V70
- [10] Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics