共 8 条
- [1] Tsallis-INF for Decoupled Exploration and Exploitation in Multi-armed Bandits CONFERENCE ON LEARNING THEORY, VOL 125, 2020, 125
- [6] The Exploration-Exploitation Trade-off in Interactive Recommender Systems PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 431 - 435
- [7] Exploration with Limited Memory: Streaming Algorithms for Coin Tossing, Noisy Comparisons, and Multi-armed Bandits PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20), 2020, : 1237 - 1250