Sample complexity of partition identification using multi-armed bandits

被引:0
|
作者
Juneja, Sandeep [1 ]
Krishnasamy, Subhashini [1 ]
机构
[1] TIFR, Mumbai, Maharashtra, India
来源
CONFERENCE ON LEARNING THEORY, VOL 99 | 2019年 / 99卷
关键词
multi-armed bandits; best arm identification; pure exploration; partition identification; ALLOCATION; SIMULATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a vector of probability distributions, or arms, each of which can be sampled independently, we consider the problem of identifying the partition to which this vector belongs from a finitely partitioned universe of such vector of distributions. We study this as a pure exploration problem in multi-armed bandit settings and develop sample complexity bounds on the total mean number of samples required for identifying the correct partition with high probability. This framework subsumes well studied problems such as finding the best arm or the best few arms. We consider distributions belonging to the single parameter exponential family and primarily consider partitions where the vector of means of arms lie either in a given set or its complement. The sets considered correspond to distributions where there exists a mean above a specified threshold, where the set is a half space and where either the set or its complement is a polytope, or more generally, a convex set. In these settings, we characterize the lower bounds on mean number of samples for each arm highlighting their dependence on the problem geometry. Further, inspired by the lower bounds, we propose algorithms that can match these bounds asymptotically with decreasing probability of error. Applications of this framework may be diverse. We briefly discuss a few associated with simulation in finance.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] An empirical evaluation of active inference in multi-armed bandits
    Markovic, Dimitrije
    Stojic, Hrvoje
    Schwoebel, Sarah
    Kiebel, Stefan J.
    NEURAL NETWORKS, 2021, 144 : 229 - 246
  • [32] On Federated Multi-Armed Bandits for Mobile Social Networks
    Sakai, Kazuya
    Kitamura, Takeshi
    Sun, Min-Te
    Ku, Wei-Shinn
    2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 774 - 784
  • [33] Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
    Lin, Baihan
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [34] Multi-Armed Bandits With Self-Information Rewards
    Weinberger, Nir
    Yemini, Michal
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (11) : 7160 - 7184
  • [35] Bounded Regret for Finitely Parameterized Multi-Armed Bandits
    Panaganti, Kishan
    Kalathil, Dileep
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 1073 - 1078
  • [36] Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals
    Heyden, Marco
    Arzamasov, Vadim
    Fouche, Edouard
    Boehm, Klemens
    PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, : 1073 - 1084
  • [37] Dynamic Ambulance Redeployment via Multi-armed Bandits
    Sahin, Umitcan
    Yucesoy, Veysel
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [38] CORRELATED MULTI-ARMED BANDITS WITH A LATENT RANDOM SOURCE
    Gupta, Samarth
    Joshi, Gauri
    Yagan, Osman
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3572 - 3576
  • [39] Adaptive Data Depth via Multi-Armed Bandits
    Baharav, Tavor Z.
    Lai, Tze Leung
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [40] Risk Control of Best Arm Identification in Multi-Armed Bandits via Successive Rejects
    Yu, Xiaotian
    King, Irwin
    Lyu, Michael R.
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1147 - 1152