共 30 条
[1]
Abbasi-Yadkori Y., 2011, COLT, P1
[5]
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
[J].
FOUNDATIONS AND TRENDS IN MACHINE LEARNING,
2012, 5 (01)
:1-122
[6]
Coppens P, 2020, PR MACH LEARN RES, V120, P521
[7]
Dean S, 2018, ADV NEUR IN, V31
[8]
A System Level Approach to Regret Optimal Control
[J].
IEEE CONTROL SYSTEMS LETTERS,
2022, 6
:2792-2797