共 30 条
- [1] [Anonymous], 2001, Tech. Rep. CMU-RI-TR-01-25
- [2] Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
- [3] Bertsekas D., 2012, Dynamic Programming and Optimal Control, V4
- [4] Besbes O., 2014, OPTIMAL EXPLORATION
- [10] The multi-armed bandit, with constraints [J]. ANNALS OF OPERATIONS RESEARCH, 2013, 208 (01) : 37 - 62