Bandits with Knapsacks and Predictions

被引:0
|
作者
Drago, Davide [1 ]
Celli, Andrea [1 ]
Elias, Marek [1 ]
机构
[1] Bocconi Univ, Milan, Italy
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We study the Bandits with Knapsacks problem with the aim of designing a learning-augmented online learning algorithm upholding better regret guarantees than the state-of-the-art primal-dual algorithms with worst-case guarantees, under both stochastic and adversarial inputs. In the adversarial case, we obtain better competitive ratios when the input predictions are accurate, while also maintaining worst-case guarantees for imprecise predictions. We introduce two algorithms tailored for the full and bandit feedback settings, respectively. Both algorithms integrate a static prediction with a worst-case no-a-regret algorithm. This yields an optimized competitive ratio of (pi + (1 - pi)/alpha)(-1) in scenarios where the prediction is perfect, and a competitive ratio of alpha/(1-pi) in the case of highly imprecise predictions, where pi is an element of(0, 1) is chosen by the learner and alpha is Slater's parameter. We complement this analysis by studying the stochastic setting under full feedback. We provide an algorithm which guarantees a pseudo-regret of (O) over tilde (root T) with poor predictions, and 0 pseudo-regret with perfect predictions.
引用
收藏
页码:1189 / 1206
页数:18
相关论文
共 50 条
  • [1] Bandits with Knapsacks
    Badanidiyuru, Ashwinkumar
    Kleinberg, Robert
    Slivkins, Aleksandrs
    JOURNAL OF THE ACM, 2018, 65 (03)
  • [2] Adversarial Bandits with Knapsacks
    Immorlica, Nicole
    Sankararaman, Karthik Abinav
    Schapire, Robert
    Slivkins, Aleksandrs
    2019 IEEE 60TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2019), 2019, : 202 - 219
  • [3] Adversarial Bandits with Knapsacks
    Immorlica, Nicole
    Sankararaman, Karthik
    Schapire, Robert
    Slivkins, Aleksandrs
    JOURNAL OF THE ACM, 2022, 69 (06)
  • [4] Approximately Stationary Bandits with Knapsacks
    Fikioris, Giannis
    Tardos, Eva
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [5] Bandits with Knapsacks (Extended Abstract)
    Badanidiyuru, Ashwinkumar
    Kleinberg, Robert
    Slivkins, Aleksandrs
    2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2013, : 207 - 216
  • [6] On Logarithmic Regret for Bandits with Knapsacks
    Ren, Wenbo
    Liu, Jia
    Shroff, Ness B.
    2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [7] Linear Contextual Bandits with Knapsacks
    Agrawal, Shipra
    Devanur, Nikhil R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] Combinatorial Semi-Bandits with Knapsacks
    Sankararaman, Karthik Abinav
    Slivkins, Aleksandrs
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [9] Bandits with Knapsacks beyond the Worst Case
    Sankararaman, Karthik Abinav
    Slivkins, Aleksandrs
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Non-stationary Bandits with Knapsacks
    Liu, Shang
    Jiang, Jiashuo
    Li, Xiaocheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,