Bayesian Incentive-Compatible Bandit Exploration

被引：16

作者：

Mansour, Yishay ^{[1
]}

Slivkins, Aleksandrs ^{[2
]}

Syrgkanis, Vasilis ^{[3
]}

机构：

[1] Tel Aviv Univ, Sch Comp Sci, IL-6997801 Tel Aviv, Israel

[2] Microsoft Res, New York, NY 10011 USA

[3] Microsoft Res, Cambridge, MA 02142 USA

来源：

OPERATIONS RESEARCH | 2020年 / 68卷 / 04期

关键词：

mechanism design; multiarmed bandits; regret; Bayesian incentive-compatibility; CLINICAL-TRIAL DESIGN; MULTIARMED BANDIT; ALGORITHMS; SIGNATURE; REGRET; BOUNDS; ERA;

D O I：

10.1287/opre.2019.1949

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

As self-interested individuals ("agents") make decisions over time, they utilize information revealed by other agents in the past and produce information that may help agents in the future. This phenomenon is common in a wide range of scenarios in the Internet economy, as well as in medical decisions. Each agent would like to exploit: select the best action given the current information, but would prefer the previous agents to explore: try out various alternatives to collect information. A social planner, by means of a carefully designed recommendation policy, can incentivize the agents to balance the exploration and exploitation so as to maximize social welfare. We model the planner's recommendation policy as a multiarm bandit algorithm under incentive-compatibility constraints induced by agents' Bayesian priors. We design a bandit algorithm which is incentive-compatible and has asymptotically optimal performance, as expressed by regret. Further, we provide a black-box reduction from an arbitrary multiarm bandit algorithm to an incentive-compatible one, with only a constant multiplicative increase in regret. This reduction works for very general bandit settings that incorporate contexts and arbitrary partial feedback.

引用

页码：1132 / 1161

页数：30

共 50 条

[1] An Incentive-Compatible Multi-Armed Bandit Mechanism
Gonen, Rica
Pavlov, Elan
PODC'07: PROCEEDINGS OF THE 26TH ANNUAL ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, 2007, : 362 - 363
[2] Incentive-Compatible Diffusion
Babichenko, Yakov
Dean, Oren
Tennenholtz, Moshe
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1379 - 1388
[3] Incentive-Compatible Assortment Optimization for Sponsored Products
Balseiro, Santiago R.
Desir, Antoine
MANAGEMENT SCIENCE, 2023, 69 (08) : 4668 - 4684
[4] An Incentive-Compatible Mechanism for Decentralized Storage Network
Vakilinia, Iman
Wang, Weihong
Xin, Jiajun
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (04): : 2294 - 2306
[5] Bayesian incentive compatible parametrization of mechanisms
Weber, Thomas A.
Bapna, Abhishek
JOURNAL OF MATHEMATICAL ECONOMICS, 2008, 44 (3-4) : 394 - 403
[6] An Incentive-Compatible and Computationally Efficient Fog Bargaining Mechanism
Kwang Mong Sim
Computational Economics, 2023, 62 : 1883 - 1918
[7] Incentive-Compatible Learning of Reserve Prices for Repeated Auctions
Kanoria, Yash
Nazerzadeh, Hamid
OPERATIONS RESEARCH, 2021, 69 (02) : 509 - 524
[8] An Incentive-Compatible Scheme for Electricity Cooperatives: An Axiomatic Approach
Ehsanfar, Abbas
Heydari, Babak
IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (02) : 1416 - 1424
[9] An Incentive-Compatible and Computationally Efficient Fog Bargaining Mechanism
Sim, Kwang Mong
COMPUTATIONAL ECONOMICS, 2023, 62 (04) : 1883 - 1918
[10] Collusion-resistant, Incentive-compatible Feedback Payments
Jurca, Radu
Faltings, Boi
EC'07: PROCEEDINGS OF THE EIGHTH ANNUAL CONFERENCE ON ELECTRONIC COMMERCE, 2007, : 200 - 209

← 1 2 3 4 5 →