共 15 条
- [1] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [2] Azar M. G., 2014, INT C MACH LEARN
- [3] Bubeck S., 2011, J. of Machine Learning Research (JMLR), V12, P1587
- [5] Bubeck Sebastien, 2011, ALGORITHMIC LEARNING
- [7] Combes R., 2015, ARXIV E PRINTS
- [8] Coquelin Pierre-Arnaud., 2007, Uncertainty in Artificial Intelligence
- [9] Kleinberg R., 2008, S THEOR COMP
- [10] Kocsis L., 2006, EUR C MACH LEARN