Sample complexity of partition identification using multi-armed bandits

被引：0

作者：

Juneja, Sandeep ^{[1
]}

Krishnasamy, Subhashini ^{[1
]}

机构：

[1] TIFR, Mumbai, Maharashtra, India

来源：

CONFERENCE ON LEARNING THEORY, VOL 99 | 2019年 / 99卷

关键词：

multi-armed bandits; best arm identification; pure exploration; partition identification; ALLOCATION; SIMULATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given a vector of probability distributions, or arms, each of which can be sampled independently, we consider the problem of identifying the partition to which this vector belongs from a finitely partitioned universe of such vector of distributions. We study this as a pure exploration problem in multi-armed bandit settings and develop sample complexity bounds on the total mean number of samples required for identifying the correct partition with high probability. This framework subsumes well studied problems such as finding the best arm or the best few arms. We consider distributions belonging to the single parameter exponential family and primarily consider partitions where the vector of means of arms lie either in a given set or its complement. The sets considered correspond to distributions where there exists a mean above a specified threshold, where the set is a half space and where either the set or its complement is a polytope, or more generally, a convex set. In these settings, we characterize the lower bounds on mean number of samples for each arm highlighting their dependence on the problem geometry. Further, inspired by the lower bounds, we propose algorithms that can match these bounds asymptotically with decreasing probability of error. Applications of this framework may be diverse. We briefly discuss a few associated with simulation in finance.

引用

页数：29

共 50 条

[31] An empirical evaluation of active inference in multi-armed bandits
Markovic, Dimitrije
Stojic, Hrvoje
Schwoebel, Sarah
Kiebel, Stefan J.
NEURAL NETWORKS, 2021, 144 : 229 - 246
[32] On Federated Multi-Armed Bandits for Mobile Social Networks
Sakai, Kazuya
Kitamura, Takeshi
Sun, Min-Te
Ku, Wei-Shinn
2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 774 - 784
[33] Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
Lin, Baihan
2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
[34] Multi-Armed Bandits With Self-Information Rewards
Weinberger, Nir
Yemini, Michal
IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (11) : 7160 - 7184
[35] Bounded Regret for Finitely Parameterized Multi-Armed Bandits
Panaganti, Kishan
Kalathil, Dileep
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 1073 - 1078
[36] Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals
Heyden, Marco
Arzamasov, Vadim
Fouche, Edouard
Boehm, Klemens
PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, : 1073 - 1084
[37] Dynamic Ambulance Redeployment via Multi-armed Bandits
Sahin, Umitcan
Yucesoy, Veysel
2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
[38] CORRELATED MULTI-ARMED BANDITS WITH A LATENT RANDOM SOURCE
Gupta, Samarth
Joshi, Gauri
Yagan, Osman
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3572 - 3576
[39] Adaptive Data Depth via Multi-Armed Bandits
Baharav, Tavor Z.
Lai, Tze Leung
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[40] Risk Control of Best Arm Identification in Multi-Armed Bandits via Successive Rejects
Yu, Xiaotian
King, Irwin
Lyu, Michael R.
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1147 - 1152

← 1 2 3 4 5 →