共 28 条
- [1] Amin K., 2013, ADV NEURAL INFORM PR, P1169
- [2] [Anonymous], 2011, P 24 ANN C LEARN THE
- [3] [Anonymous], 2011, NIPS
- [5] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [6] Bandits with Knapsacks (Extended Abstract) [J]. 2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2013, : 207 - 216
- [7] Online learning in online auctions [J]. THEORETICAL COMPUTER SCIENCE, 2004, 324 (2-3) : 137 - 146
- [10] Combes R, 2014, PR MACH LEARN RES, V32