Competing Bandits: The Perils of Exploration Under Competition

被引:0
作者
Aridor, Guy [1 ]
Mansour, Yishay [2 ]
Slivkins, Aleksandrs [3 ]
Wu, Steven [4 ]
机构
[1] Northwestern Univ, Mkt, Evanston, IL 60208 USA
[2] Tel Aviv Univ, Comp Sci, Tel Aviv, Israel
[3] Microsoft Res New York, New York, NY USA
[4] Carnegie Mellon Univ, Pittsburgh, PA USA
关键词
Multi-armed bandits; exploration; competition vs. innovation; MULTIARMED BANDIT; PRODUCT DIFFERENTIATION; EXPERIMENTATION; INNOVATION; TRADE;
D O I
10.1145/3711831
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Most online platforms learn from interactions with users and engage in exploration: making potentially suboptimal choices to acquire new information. We study the interplay between exploration and competition: how such platforms balance the exploration for learning and competition for users. We consider a stylized duopoly in which two firms face the same multi-armed bandit problem. Users arrive one by one and choose between the two firms, so that each firm makes progress on its bandit problem only if it is chosen. We study whether competition incentivizes the adoption of better algorithms. We find that stark competition disincentivizes exploration, leading to low welfare. However, weaker competition incentivizes better exploration algorithms and increases welfare. We investigate two channels for weakening the competition: stochastic user choice models and a first-mover advantage. Our findings speak to the competition-innovation relationship and the first-mover advantage in the digital economy.
引用
收藏
页数:47
相关论文
共 71 条
[1]  
Agarwal A, 2017, Arxiv, DOI arXiv:1606.03966
[2]   Competition and innovation: An inverted-U relationship [J].
Aghion, P ;
Bloom, N ;
Blundell, R ;
Griffith, R ;
Howitt, P .
QUARTERLY JOURNAL OF ECONOMICS, 2005, 120 (02) :701-728
[3]  
Agrawal Shipra, 2013, AISTATS, P99
[4]  
Agrawal Shipra, 2012, P 25 C LEARN THEOR C
[5]   THE ROLE OF INFORMATION IN INNOVATION AND COMPETITION [J].
Akcigit, Ufuk ;
Liu, Qingmin .
JOURNAL OF THE EUROPEAN ECONOMIC ASSOCIATION, 2016, 14 (04) :828-870
[6]   The Perils of Exploration under Competition: A Computational Modeling Approach [J].
Aridor, Guy ;
Liu, Kevin ;
Slivkins, Aleksandrs ;
Wu, Zhiwei Steven .
ACM EC '19: PROCEEDINGS OF THE 2019 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2019, :171-172
[7]  
Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
[8]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[9]   CHARACTERIZING TRUTHFUL MULTI-ARMED BANDIT MECHANISMS [J].
Babaioff, Moshe ;
Sharma, Yogeshwer ;
Slivkins, Aleksandrs .
SIAM JOURNAL ON COMPUTING, 2014, 43 (01) :194-230
[10]   INFORMATIONAL PRODUCT DIFFERENTIATION AS A BARRIER TO ENTRY [J].
BAGWELL, K .
INTERNATIONAL JOURNAL OF INDUSTRIAL ORGANIZATION, 1990, 8 (02) :207-223