共 20 条
- [3] [Anonymous], 2012, REGRET ANAL STOCHAST
- [4] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [5] ON SOME ROBUST ESTIMATES OF LOCATION [J]. ANNALS OF MATHEMATICAL STATISTICS, 1965, 36 (03): : 847 - 858
- [6] Bubeck S., 2010, THESIS U LILLE 1 LIL
- [7] Catoni O., 2010, CHALLENGING EMPIRICA
- [8] An optimal algorithm for Monte Carlo estimation [J]. SIAM JOURNAL ON COMPUTING, 2000, 29 (05) : 1484 - 1496