Anytime Algorithms for Multi-Armed Bandit Problems

被引:10
|
作者
Kleinberg, Robert [1 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
关键词
D O I
10.1145/1109557.1109659
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
引用
收藏
页码:928 / 936
页数:9
相关论文
共 50 条
  • [21] The Assistive Multi-Armed Bandit
    Chan, Lawrence
    Hadfield-Menell, Dylan
    Srinivasa, Siddhartha
    Dragan, Anca
    HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 354 - 363
  • [22] Multi-armed bandit games
    Gursoy, Kemal
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [23] Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback
    Wang, Siwei
    Wang, Haoyun
    Huang, Longbo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10210 - 10217
  • [24] Improving multi-armed bandit algorithms in online pricing settings
    Trovo, Francesco
    Paladino, Stefano
    Restelli, Marcello
    Gatti, Nicola
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 98 : 196 - 235
  • [25] Multi-armed bandit algorithms over DASH for multihomed client
    Hodroj, Ali
    Ibrahim, Marc
    Hadjadj-Aoul, Yassine
    Sericola, Bruno
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2021, 37 (04) : 244 - 253
  • [26] Reconfigurable and Computationally Efficient Architecture for Multi-armed Bandit Algorithms
    Santosh, S. V. Sai
    Darak, S. J.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [27] A Satisficing Strategy with Variable Reference in the Multi-armed Bandit Problems
    Kohno, Yu
    Takahashi, Tatsuji
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [28] GAUSSIAN PROCESS MODELLING OF DEPENDENCIES IN MULTI-ARMED BANDIT PROBLEMS
    Dorard, Louis
    Glowacka, Dorota
    Shawe-Taylor, John
    PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH SOR 09, 2009, : 77 - 84
  • [29] Time-Varying Stochastic Multi-Armed Bandit Problems
    Vakili, Sattar
    Zhao, Qing
    Zhou, Yuan
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 2103 - 2107
  • [30] Synchronization and optimality for multi-armed bandit problems in continuous time
    ElKaroui, N
    Karatzas, I
    COMPUTATIONAL & APPLIED MATHEMATICS, 1997, 16 (02): : 117 - 151