Thompson sampling for multi-armed bandits in big data environments

被引：0

作者：

Kim, Min Kyong ^{[1
]}

Hwang, Beom Seuk ^{[1
]}

机构：

[1] Chung Ang Univ, Dept Appl Stat, 84 Heukseok Ro, Seoul 06974, South Korea

来源：

KOREAN JOURNAL OF APPLIED STATISTICS | 2024年 / 37卷 / 05期

关键词：

approximation; Bayesian optimization; multi-armed bandits; statistical learning; Thompson sampling;

D O I：

10.5351/KJAS.2024.37.5.663

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

The multi-armed bandits (MAB) problem, involves selecting actions to maximize rewards within dynamic environments. This study explores the application of Thompson sampling, a robust MAB algorithm, within the context of big data analytics and statistical learning theory. By leveraging large-scale banner click data from recommendation systems, we evaluate Thompson sampling's performance across various simulated scenarios, employing advanced approximation techniques. Our findings demonstrate that Thompson sampling, particularly with Langevin Monte Carlo approximation, maintains robust performance and scalability in big data environments. This underscores its practical significance and adaptability, aligning with contemporary challenges in statistical learning

引用

页数：12

共 50 条

[1] Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
Lin, Baihan
2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
[2] Asymptotic Performance of Thompson Sampling for Batched Multi-Armed Bandits
Kalkanli, Cem
Ozgur, Ayfer
IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (09) : 5956 - 5970
[3] Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits
Park, Hongju
Faradonbeh, Mohamad Kazem Shirani
IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2150 - 2155
[4] Adaptive Data Depth via Multi-Armed Bandits
Baharav, Tavor Z.
Lai, Tze Leung
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[5] Visualizations for interrogations of multi-armed bandits
Keaton, Timothy J.
Sabbaghi, Arman
STAT, 2019, 8 (01):
[6] Multi-armed bandits with dependent arms
Singh, Rahul
Liu, Fang
Sun, Yin
Shroff, Ness
MACHINE LEARNING, 2024, 113 (01) : 45 - 71
[7] Multi-armed bandits with episode context
Christopher D. Rosin
Annals of Mathematics and Artificial Intelligence, 2011, 61 : 203 - 230
[8] Multi-Armed Bandits With Costly Probes
Elumar, Eray Can
Tekin, Cem
Yagan, Osman
IEEE TRANSACTIONS ON INFORMATION THEORY, 2025, 71 (01) : 618 - 643
[9] Multi-Armed Bandits With Correlated Arms
Gupta, Samarth
Chaudhari, Shreyas
Joshi, Gauri
Yagan, Osman
IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (10) : 6711 - 6732
[10] Multi-armed bandits with episode context
Rosin, Christopher D.
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2011, 61 (03) : 203 - 230

← 1 2 3 4 5 →