共 28 条
- [1] Afantenos Stergos D., 2015, P 2015 C EMP METH NA, P928
- [2] Anschel O., 2016, arXiv
- [3] Bellman R, 2013, DYNAMIC PROGRAMMING
- [4] Cuayahuitl H., 2015, P NIPS DEEP REINFORC
- [5] Dearden R, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P761
- [6] Dobre MS, 2015, IEEE CONF COMPU INTE, P60, DOI 10.1109/CIG.2015.7317942
- [7] Finnman P, 2016, Deep reinforcement learning compared with Q-table learning applied to backgammon
- [8] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
- [9] Guhe Markus., 2014, 2014 IEEE Conference on Computational Intelligence and Games, P1
- [10] Hausknecht M, 2015, 2015 AAAI FALL S SER, P29, DOI 10.1.1.696.1421