Sample complexity of partition identification using multi-armed bandits

被引:0
|
作者
Juneja, Sandeep [1 ]
Krishnasamy, Subhashini [1 ]
机构
[1] TIFR, Mumbai, Maharashtra, India
来源
CONFERENCE ON LEARNING THEORY, VOL 99 | 2019年 / 99卷
关键词
multi-armed bandits; best arm identification; pure exploration; partition identification; ALLOCATION; SIMULATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a vector of probability distributions, or arms, each of which can be sampled independently, we consider the problem of identifying the partition to which this vector belongs from a finitely partitioned universe of such vector of distributions. We study this as a pure exploration problem in multi-armed bandit settings and develop sample complexity bounds on the total mean number of samples required for identifying the correct partition with high probability. This framework subsumes well studied problems such as finding the best arm or the best few arms. We consider distributions belonging to the single parameter exponential family and primarily consider partitions where the vector of means of arms lie either in a given set or its complement. The sets considered correspond to distributions where there exists a mean above a specified threshold, where the set is a half space and where either the set or its complement is a polytope, or more generally, a convex set. In these settings, we characterize the lower bounds on mean number of samples for each arm highlighting their dependence on the problem geometry. Further, inspired by the lower bounds, we propose algorithms that can match these bounds asymptotically with decreasing probability of error. Applications of this framework may be diverse. We briefly discuss a few associated with simulation in finance.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Secure Best Arm Identification in Multi-armed Bandits
    Ciucanu, Radu
    Lafourcade, Pascal
    Lombard-Platet, Marius
    Soare, Marta
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2019, 2019, 11879 : 152 - 171
  • [2] Case-Based Sample Generation Using Multi-Armed Bandits
    Korger, Andreas
    Baumeister, Joachim
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2023, 2023, 14141 : 118 - 133
  • [3] On Sequential Elimination Algorithms for Best-Arm Identification in Multi-Armed Bandits
    Shahrampour, Shahin
    Noshad, Mohammad
    Tarokh, Vahid
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (16) : 4281 - 4292
  • [4] On the Bias, Risk, and Consistency of Sample Means in Multi-armed Bandits
    Shin, Jaehyeok
    Ramdas, Aaditya
    Rinaldo, Alessandro
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (04): : 1278 - 1300
  • [5] Best-Arm Identification in Correlated Multi-Armed Bandits
    Gupta S.
    Joshi G.
    Yagan O.
    IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 549 - 563
  • [6] Pruning neural networks using multi-armed bandits
    Ameen S.
    Vadera S.
    Computer Journal, 2020, 63 (07): : 1099 - 1108
  • [7] Best Arm Identification in Restless Markov Multi-Armed Bandits
    Karthik, P. N.
    Reddy, Kota Srinivas
    Tan, Vincent Y. F.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (05) : 3240 - 3262
  • [8] Multi-armed bandits with dependent arms
    Singh, Rahul
    Liu, Fang
    Sun, Yin
    Shroff, Ness
    MACHINE LEARNING, 2024, 113 (01) : 45 - 71
  • [9] Multi-armed bandits with episode context
    Christopher D. Rosin
    Annals of Mathematics and Artificial Intelligence, 2011, 61 : 203 - 230
  • [10] Multi-Armed Bandits With Costly Probes
    Elumar, Eray Can
    Tekin, Cem
    Yagan, Osman
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2025, 71 (01) : 618 - 643