共 62 条
[1]
Abbasi-Yadkori Y., ADV NEURAL INFORM PR, P2312
[6]
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
[J].
FOUNDATIONS AND TRENDS IN MACHINE LEARNING,
2012, 5 (01)
:1-122
[7]
Chu W., 2011, P 14 INT C ARTIFICIA, P208, DOI DOI 10.48550/ARXIV.1209.3352