Strategic experimentation with Poisson bandits

被引：59

作者：

Keller, Godfrey ^{[1
]}

Rady, Sven ^{[2
]}

机构：

[1] Univ Oxford, Dept Econ, Oxford OX1 2JD, England

[2] Univ Munich, Dept Econ, D-80539 Munich, Germany

来源：

THEORETICAL ECONOMICS | 2010年 / 5卷 / 02期

关键词：

Strategic experimentation; two-armed bandit; Poisson process; Bayesian learning; piecewise deterministic process; Markov perfect equilibrium; differential-difference equation;

D O I：

10.3982/TE595

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

We study a game of strategic experimentation with two-armed bandits where the risky arm distributes lump-sum payoffs according to a Poisson process. Its intensity is either high or low, and unknown to the players. We considerMarkov perfect equilibria with beliefs as the state variable and show that all such equilibria exhibit an "encouragement effect" relative to the single-agent optimum. There is no equilibrium in which all players use cutoff strategies. Owing to the encouragement effect, asymmetric equilibria in which players take turns playing the risky arm before all experimentation stops Pareto dominate the unique symmetric equilibrium. Rewarding the last experimenter with a higher continuation value increases the range of beliefs where players experiment, but may reduce the intensity of experimentation at more optimistic beliefs. This suggests that there is no equilibrium that uniformly maximizes the players' average payoff.

引用

页码：275 / 311

页数：37

共 18 条

[1]

Bellman R.E., 1963, Differential-Difference Equations, V6

[2]

Bergemann Dirk., 2008, The New Palgrave Dictionary of Economics, V2

[3]

BESANKO D, 2008, IMPACT MARKET UNPUB

[4] Strategic experimentation [J].

Bolton, P ;

Harris, C .

ECONOMETRICA, 1999, 67 (02) :349-374

[5]

BOLTON P., 2000, INCENTIVES ORG PUBLI, P53

[6]

COHEN A, 2009, BANDIT PROBLEM UNPUB

[7]

Davis M. H. A., 1993, MARKOV MODELS OPTIMI

[8] Investment timing and learning externalities [J].

Décamps, JP ;

Mariotti, T .

JOURNAL OF ECONOMIC THEORY, 2004, 118 (01) :80-102

[9]

HOPENHAYN H, 2008, PREEMPTION GAM UNPUB

[10] Strategic experimentation with exponential bandits [J].

Keller, G ;

Rady, S ;

Cripps, M .

ECONOMETRICA, 2005, 73 (01) :39-68

← 1 2 →