共 47 条
[2]
Agrawal S., 2013, INT C MACHINE LEARNI, V28, P127, DOI DOI 10.5555/3042817.3043073
[3]
A neural networks committee for the contextual bandit problem
[J].
Allesiardo, Robin,
1600, Springer Verlag (8834)
:374-381
[4]
Anderson T, 2008, THEORY AND PRACTICE OF ONLINE LEARNING, 2ND EDITION, P45
[5]
[Anonymous], 2017, ARXIV171102487
[6]
[Anonymous], 2021, SIGKDD, DOI DOI 10.1145/3447548.3467299
[7]
[Anonymous], 2011, UNBIASED OFFLINE EVA
[8]
[Anonymous], 2014, RECSYS, DOI DOI 10.1145/2645710.2645733
[9]
[Anonymous], 2017, SIGKDD, DOI DOI 10.1145/3097983.3098041
[10]
Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375