共 35 条
- [11] Dayan Peter, 1993, Advances in neural information processing systems, P271
- [12] Dietterich Thomas G., 2000, J ARTIF INTELL RES J
- [13] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
- [14] Jaderberg Max, 2016, ARXIV161105397
- [15] Kaelbling Leslie Pack, 2014, ICML
- [16] Kulkarni TD., 2016, ADV NEURAL INFORM PR, P3682
- [18] Lillicrap T. P., 2015, 4 INT C LEARN REPR I, DOI [10.48550/arXiv.1509.02971, DOI 10.48550/ARXIV.1509.02971]
- [19] Mnih V, 2016, PR MACH LEARN RES, V48
- [20] Human-level control through deep reinforcement learning [J]. NATURE, 2015, 518 (7540) : 529 - 533