共 23 条
- [1] THE CONTINUUM-ARMED BANDIT PROBLEM [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1995, 33 (06) : 1926 - 1951
- [2] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [3] Improved rates for the stochastic continuum-armed bandit problem [J]. LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 454 - +
- [4] Bubeck S., 2010, THESIS U LILLE 1, P1
- [5] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01): : 1 - 122
- [6] Bubeck S, 2011, LECT NOTES ARTIF INT, V6925, P144, DOI 10.1007/978-3-642-24412-4_14
- [7] Bubeck S, 2011, J MACH LEARN RES, V12, P1655
- [8] Bubeck S, 2009, LECT NOTES ARTIF INT, V5809, P23, DOI 10.1007/978-3-642-04414-4_7
- [9] Bull A.D., 2014, ADAPTIVE TREED BAN S, DOI [10.3150/14-BEJ644SUPP, DOI 10.3150/14-BEJ644SUPP]