共 25 条
[1]
Azoulay-Schwartz R(2004)Exploitation vs. exploration: choosing a supplier in an environment of incomplete information Decis Support Syst 38 1-18
[2]
Kraus S(1994)Switching costs and the Gittins index Econometrica 62 687-694
[3]
Wilkenfeld J(2002)Optimal learning and experimentation in bandit problems J Econ Dyn Control 27 87-107
[4]
Banks JS(2005)Information technology project failures: applying the bandit problem to evaluate managerial decision making Inf Manag Comp Secur 13 135-143
[5]
Sundaram RK(2000)Do evolutionary processes minimize expected losses? J Theor Biol 207 117-123
[6]
Brezzi M(2004)A survey on the bandit problem with switching costs Economist 152 513-541
[7]
Lai TL(1996)Reinforcement learning: a survey J Artif Intell Res 4 237-285
[8]
Chulkov DV(1998)Multi-armed bandits in discrete and continuous time Ann Appl Probab 8 1270-1290
[9]
Desai MS(2001)Dynamic pricing on the internet: theory and simulations Electron Commer Res 1 265-276
[10]
Fogel DB(1981)Systematic search, related information and the Gittins’ index Econ Lett 8 327-333