共 40 条
[1]
Agrawal S, 2016, ADV NEUR IN, V29
[2]
Agrawal Shipra, 2014, P 15 ACM C EC COMP, P989, DOI [10.1145/2600057.2602844, DOI 10.1145/2600057.2602844]
[3]
Improved rates for the stochastic continuum-armed bandit problem
[J].
LEARNING THEORY, PROCEEDINGS,
2007, 4539
:454-+
[4]
Bandits with Knapsacks (Extended Abstract)
[J].
2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS),
2013,
:207-216
[9]
Buche R, 2001, SIAM J CONTROL OPTIM, V40, P1011, DOI 10.1137/S0363012999361639