Sample complexity of partition identification using multi-armed bandits

被引：0

作者：

Juneja, Sandeep ^{[1
]}

Krishnasamy, Subhashini ^{[1
]}

机构：

[1] TIFR, Mumbai, Maharashtra, India

来源：

CONFERENCE ON LEARNING THEORY, VOL 99 | 2019年 / 99卷

关键词：

multi-armed bandits; best arm identification; pure exploration; partition identification; ALLOCATION; SIMULATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given a vector of probability distributions, or arms, each of which can be sampled independently, we consider the problem of identifying the partition to which this vector belongs from a finitely partitioned universe of such vector of distributions. We study this as a pure exploration problem in multi-armed bandit settings and develop sample complexity bounds on the total mean number of samples required for identifying the correct partition with high probability. This framework subsumes well studied problems such as finding the best arm or the best few arms. We consider distributions belonging to the single parameter exponential family and primarily consider partitions where the vector of means of arms lie either in a given set or its complement. The sets considered correspond to distributions where there exists a mean above a specified threshold, where the set is a half space and where either the set or its complement is a polytope, or more generally, a convex set. In these settings, we characterize the lower bounds on mean number of samples for each arm highlighting their dependence on the problem geometry. Further, inspired by the lower bounds, we propose algorithms that can match these bounds asymptotically with decreasing probability of error. Applications of this framework may be diverse. We briefly discuss a few associated with simulation in finance.

引用

页数：29

共 50 条

[1] Secure Best Arm Identification in Multi-armed Bandits
Ciucanu, Radu
Lafourcade, Pascal
Lombard-Platet, Marius
Soare, Marta
INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2019, 2019, 11879 : 152 - 171
[2] Case-Based Sample Generation Using Multi-Armed Bandits
Korger, Andreas
Baumeister, Joachim
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2023, 2023, 14141 : 118 - 133
[3] On Sequential Elimination Algorithms for Best-Arm Identification in Multi-Armed Bandits
Shahrampour, Shahin
Noshad, Mohammad
Tarokh, Vahid
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (16) : 4281 - 4292
[4] On the Bias, Risk, and Consistency of Sample Means in Multi-armed Bandits
Shin, Jaehyeok
Ramdas, Aaditya
Rinaldo, Alessandro
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (04): : 1278 - 1300
[5] Best-Arm Identification in Correlated Multi-Armed Bandits
Gupta S.
Joshi G.
Yagan O.
IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 549 - 563
[6] Pruning neural networks using multi-armed bandits
Ameen S.
Vadera S.
Computer Journal, 2020, 63 (07): : 1099 - 1108
[7] Best Arm Identification in Restless Markov Multi-Armed Bandits
Karthik, P. N.
Reddy, Kota Srinivas
Tan, Vincent Y. F.
IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (05) : 3240 - 3262
[8] Multi-armed bandits with dependent arms
Singh, Rahul
Liu, Fang
Sun, Yin
Shroff, Ness
MACHINE LEARNING, 2024, 113 (01) : 45 - 71
[9] Multi-armed bandits with episode context
Christopher D. Rosin
Annals of Mathematics and Artificial Intelligence, 2011, 61 : 203 - 230
[10] Multi-Armed Bandits With Costly Probes
Elumar, Eray Can
Tekin, Cem
Yagan, Osman
IEEE TRANSACTIONS ON INFORMATION THEORY, 2025, 71 (01) : 618 - 643

← 1 2 3 4 5 →