共 35 条
[2]
[Anonymous], 2012, DYNAMIC PROGRAMMING
[3]
[Anonymous], 2011, Multi-armed bandit allocation indices
[4]
Bacinoglu BT, 2015, 2015 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), P25, DOI 10.1109/ITA.2015.7308962
[5]
Bertsekas D., 2012, Dynamic Programming and Optimal Control, V4
[6]
Borkar V. S., 2009, Stochastic Approximation: A Dynamical Systems Viewpoint, V48
[9]
Cho J, 2000, SIGMOD REC, V29, P117, DOI 10.1145/335191.335391