共 26 条
[1]
Amir I, 2020, ADV NEUR IN, V33
[2]
Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
[3]
Auer P, 1995, AN S FDN CO, P322, DOI 10.1109/SFCS.1995.492488
[4]
Auer P., 2016, Proceedings of the 29th Conference on Learning Theory, COLT 2016, P116
[5]
Auer Peter, 2019, P MACHINE LEARNING R, V99
[6]
Besbes O, 2014, ADV NEUR IN, V27
[7]
Besbes O, 2019, Stochastic Systems, V9, P319, DOI [10.1287/stsy.2019.0033, DOI 10.1287/STSY.2019.0033, 10.1287/stsy.2019.0033]
[9]
Bouneffouf Djallel., 2019, Working paper
[10]
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
[J].
FOUNDATIONS AND TRENDS IN MACHINE LEARNING,
2012, 5 (01)
:1-122