Oracle-Efficient Online Learning and Auction Design

被引：25

作者：

Dudik, Miroslav ^{[1
]}

Haghtalab, Nika ^{[2
]}

Luo, Haipeng ^{[3
]}

Schapire, Robert E. ^{[1
]}

Syrgkanis, Vasilis ^{[4
]}

Vaughan, Jennifer Wortman ^{[1
]}

机构：

[1] Microsoft Res, New York, NY 10036 USA

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[3] Univ Southern Calif, Los Angeles, CA USA

[4] Microsoft Res, Cambridge, MA USA

来源：

2017 IEEE 58TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS) | 2017年

关键词：

online learning; auction design; revenue maximization; Follow-the-Perturbed-Leader; ALGORITHMS;

D O I：

10.1109/FOCS.2017.55

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We consider the design of computationally efficient online learning algorithms in an adversarial setting in which the learner has access to an offline optimization oracle. We present an algorithm called Generalized Follow-the-Perturbed-Leader and provide conditions under which it is oracle-efficient while achieving vanishing regret. Our results make significant progress on an open problem raised by Hazan and Koren [1], who showed that oracle-efficient algorithms do not exist in full generality and asked whether one can identify conditions under which oracle-efficient online learning may be possible. Our auction-design framework considers an auctioneer learning an optimal auction for a sequence of adversarially selected valuations with the goal of achieving revenue that is almost as good as the optimal auction in hindsight, among a class of auctions. We give oracle-efficient learning results for: (1) VCG auctions with bidder-specific reserves in single-Parameter settings, (2) envy-free item-pricing auctions in multiitem settings, and (3) the level auctions of Morgenstern and Roughgarden [2] for single-item settings. The last result leads to an approximation of the overall optimal Myerson auction when bidders' valuations are drawn according to a fast-mixing Markov process, extending prior work that only gave such guarantees for the i.i.d. setting. We also derive various extensions, including: (1) oracle-efficient algorithms for the contextual learning setting in which the learner has access to side information (such as bidder demographics), (2) learning with approximate oracles such as those based on Maximal-in-Range algorithms, and (3) no-regret bidding algorithms in simultaneous auctions, which resolve an open problem of Daskalakis and Syrgkanis [3].

引用

页码：528 / 539

页数：12

共 30 条

[1] Agarwal A., 2014, ICML
[2] [Anonymous], EC
[3] Online linear optimization and adaptive routing
Awerbuch, Baruch
Kleinberg, Robert
[J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2008, 74 (01) : 97 - 114
[4] Balcan M.-F., 2006, EC
[5] Blum A., 2005, SODA
[6] On the generalization ability of on-line learning algorithms
Cesa-Bianchi, N
Conconi, A
Gentile, C
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2004, 50 (09) : 2050 - 2057
[7] Cesa-Bianchi Nicolo, 2013, SODA
[8] Cole R., 2014, STOC
[9] Daskalakis C., 2016, FOCS
[10] Devanur N. R., 2016, STOC

← 1 2 3 →