Bandits with Knapsacks and Predictions

被引:0
|
作者
Drago, Davide [1 ]
Celli, Andrea [1 ]
Elias, Marek [1 ]
机构
[1] Bocconi Univ, Milan, Italy
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We study the Bandits with Knapsacks problem with the aim of designing a learning-augmented online learning algorithm upholding better regret guarantees than the state-of-the-art primal-dual algorithms with worst-case guarantees, under both stochastic and adversarial inputs. In the adversarial case, we obtain better competitive ratios when the input predictions are accurate, while also maintaining worst-case guarantees for imprecise predictions. We introduce two algorithms tailored for the full and bandit feedback settings, respectively. Both algorithms integrate a static prediction with a worst-case no-a-regret algorithm. This yields an optimized competitive ratio of (pi + (1 - pi)/alpha)(-1) in scenarios where the prediction is perfect, and a competitive ratio of alpha/(1-pi) in the case of highly imprecise predictions, where pi is an element of(0, 1) is chosen by the learner and alpha is Slater's parameter. We complement this analysis by studying the stochastic setting under full feedback. We provide an algorithm which guarantees a pseudo-regret of (O) over tilde (root T) with poor predictions, and 0 pseudo-regret with perfect predictions.
引用
收藏
页码:1189 / 1206
页数:18
相关论文
共 50 条
  • [21] A NEW TRAPDOOR IN KNAPSACKS
    NIEMI, V
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 473 : 405 - 411
  • [22] ON THE GROWTH OF RANDOM KNAPSACKS
    MAMER, JW
    SCHILLING, KE
    DISCRETE APPLIED MATHEMATICS, 1990, 28 (03) : 223 - 230
  • [23] FEASIBILITY OF INTEGER KNAPSACKS
    Aliev, Iskander
    Henk, Martin
    SIAM JOURNAL ON OPTIMIZATION, 2010, 20 (06) : 2978 - 2993
  • [24] Testing integer knapsacks for feasibility
    Hansen, P
    Ryan, J
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 88 (03) : 578 - 582
  • [25] Recoverable Robust Knapsacks: Γ-Scenarios
    Buesing, Christina
    Koster, Arie M. C. A.
    Kutschka, Manuel
    NETWORK OPTIMIZATION, 2011, 6701 : 583 - 588
  • [26] RANDOM KNAPSACKS WITH MANY CONSTRAINTS
    SCHILLING, K
    DISCRETE APPLIED MATHEMATICS, 1994, 48 (02) : 163 - 174
  • [27] Non-injectivity and knapsacks
    Koskinen, Jukka A.
    Fundamenta Informaticae, 1999, 38 (1-2): : 163 - 180
  • [28] 'BANDITS, BANDITS' - GILLIAM,T
    ZIMMER, J
    REVUE DU CINEMA, 1982, (371): : 54 - 54
  • [29] 'BANDITS, BANDITS' - GILLIAN,T
    CARRERE, E
    POSITIF, 1982, (254-): : 165 - 166
  • [30] Roving bandits and stationary bandits
    Lee, S
    FORBES, 1998, 161 (09): : 149 - +