共 34 条
- [7] Hinton G. E., 1993, Advances in Neural Information Processing Systems, P3, DOI [DOI 10.5555/2987189.2987190, 10.5555/2987189.2987190]
- [9] Bandit based Monte-Carlo planning [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293