共 13 条
- [1] Altaf Meteb M., 2015, International Journal of Machine Learning and Computing, V5, P235, DOI 10.7763/IJMLC.2015.V5.513
- [2] [Anonymous], ARXIV180306773
- [3] [Anonymous], 2017, ARXIV PREPRINT ARXIV
- [4] Fortunato M., 2017, Noisy networks for exploration
- [5] Mnih V., 2013, P NEURIPS DEEP LEARN
- [6] Human-level control through deep reinforcement learning [J]. NATURE, 2015, 518 (7540) : 529 - 533
- [7] Pandey A., 2017, Int. Robotics Autom. J., V2, P96, DOI DOI 10.15406/IRATJ.2017.02.00023
- [8] Shixiang Gu, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3389, DOI 10.1109/ICRA.2017.7989385
- [10] Van Hasselt H, 2016, AAAI, V2, P5