Dynamic Allocation Optimization in A/B-Tests Using Classification-Based Preprocessing

被引：4

作者：

Claeys, Emmanuelle ^{[1
]}

Gancarski, Pierre ^{[2
,3
]}

Maumy-Bertrand, Myriam ^{[2
,3
]}

Wassner, Hubert ^{[4
]}

机构：

[1] Univ Toulouse, IRIT Lab, F-31000 Toulouse, France

[2] Univ Strasbourg, ICUBE, F-67081 Strasbourg, France

[3] Univ Strasbourg, IRMA Lab, F-67081 Strasbourg, France

[4] AB Tasty, F-75003 Paris, France

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2023年 / 35卷 / 01期

关键词：

A/B-TEST; bandit strategies; UCB strategies; conditional inference tree; non linear bandit; regret minimisation;

D O I：

10.1109/TKDE.2021.3076025

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An A/B-Test evaluates the impact of a new technology by running it in a real production environment and testing its performance on a set of items. Recent development efforts around A/B-Tests revolve around dynamic allocation. They allow for quicker determination of the best variation (A or B), thus saving money for the user. However, dynamic allocation by traditional methods requires certain assumptions, which are not always valid in reality. This is often due to the fact that the populations being tested are not homogeneous. This article reports on a new reinforcement learning methodology which has been deployed by the commercial A/B-Test platform AB Tasty. We provide a new method that not only builds homogeneous groups of users, but also allows the best variation for these groups to be found in a short period of time. This article provides numerical results on AB Tasty's data, in addition to public datasets, tha demonstrate an improvement over traditional methods.

引用

页码：335 / 349

页数：15

共 43 条

[1] Finite-time analysis of the multiarmed bandit problem [J].

Auer, P ;

Cesa-Bianchi, N ;

Fischer, P .

MACHINE LEARNING, 2002, 47 (2-3) :235-256

[2]

Bastani H., 2017, Mostly exploration-free algorithms for contextual bandits

[3]

Breiman L., 1984, Classification and regression trees, DOI DOI 10.1201/9781315139470

[4] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems [J].

Bubeck, Sebastien ;

Cesa-Bianchi, Nicolo .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01) :1-122

[5]

Burtini G, 2015, Arxiv, DOI arXiv:1510.00757

[6]

Carrara N., 2019, ADVANCESNEURAL INF P, P9299

[7]

Cesa-Bianchi N., 2014, On the complexity of learning with kernels

[8]

Chu W., 2011, P 14 INT C ART INT S, P208, DOI DOI 10.48550/ARXIV.1209.3352

[9] MULTIPLE COMPARISONS AMONG MEANS [J].

DUNN, OJ .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&

[10]

Elmachtoub A. N., 2017, ARXIV170604687

← 1 2 3 4 5 →