Bandits with Knapsacks and Predictions

被引：0

作者：

Drago, Davide ^{[1
]}

Celli, Andrea ^{[1
]}

Elias, Marek ^{[1
]}

机构：

[1] Bocconi Univ, Milan, Italy

来源：

UNCERTAINTY IN ARTIFICIAL INTELLIGENCE | 2024年 / 244卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We study the Bandits with Knapsacks problem with the aim of designing a learning-augmented online learning algorithm upholding better regret guarantees than the state-of-the-art primal-dual algorithms with worst-case guarantees, under both stochastic and adversarial inputs. In the adversarial case, we obtain better competitive ratios when the input predictions are accurate, while also maintaining worst-case guarantees for imprecise predictions. We introduce two algorithms tailored for the full and bandit feedback settings, respectively. Both algorithms integrate a static prediction with a worst-case no-a-regret algorithm. This yields an optimized competitive ratio of (pi + (1 - pi)/alpha)(-1) in scenarios where the prediction is perfect, and a competitive ratio of alpha/(1-pi) in the case of highly imprecise predictions, where pi is an element of(0, 1) is chosen by the learner and alpha is Slater's parameter. We complement this analysis by studying the stochastic setting under full feedback. We provide an algorithm which guarantees a pseudo-regret of (O) over tilde (root T) with poor predictions, and 0 pseudo-regret with perfect predictions.

引用

页码：1189 / 1206

页数：18

共 50 条

[21] A NEW TRAPDOOR IN KNAPSACKS
NIEMI, V
LECTURE NOTES IN COMPUTER SCIENCE, 1991, 473 : 405 - 411
[22] ON THE GROWTH OF RANDOM KNAPSACKS
MAMER, JW
SCHILLING, KE
DISCRETE APPLIED MATHEMATICS, 1990, 28 (03) : 223 - 230
[23] FEASIBILITY OF INTEGER KNAPSACKS
Aliev, Iskander
Henk, Martin
SIAM JOURNAL ON OPTIMIZATION, 2010, 20 (06) : 2978 - 2993
[24] Testing integer knapsacks for feasibility
Hansen, P
Ryan, J
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 88 (03) : 578 - 582
[25] Recoverable Robust Knapsacks: Γ-Scenarios
Buesing, Christina
Koster, Arie M. C. A.
Kutschka, Manuel
NETWORK OPTIMIZATION, 2011, 6701 : 583 - 588
[26] RANDOM KNAPSACKS WITH MANY CONSTRAINTS
SCHILLING, K
DISCRETE APPLIED MATHEMATICS, 1994, 48 (02) : 163 - 174
[27] Non-injectivity and knapsacks
Koskinen, Jukka A.
Fundamenta Informaticae, 1999, 38 (1-2): : 163 - 180
[28] 'BANDITS, BANDITS' - GILLIAM,T
ZIMMER, J
REVUE DU CINEMA, 1982, (371): : 54 - 54
[29] 'BANDITS, BANDITS' - GILLIAN,T
CARRERE, E
POSITIF, 1982, (254-): : 165 - 166
[30] Roving bandits and stationary bandits
Lee, S
FORBES, 1998, 161 (09): : 149 - +

← 1 2 3 4 5 →