共 50 条
- [1] Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [2] Trading off Rewards and Errors in Multi-Armed Bandits ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 709 - 717
- [4] Stochastic Multi-Armed Bandits with Control Variates ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [5] Stochastic Multi-armed Bandits in Constant Space INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
- [6] The value of information in multi-armed bandits with exponentially distributed rewards PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1363 - 1372
- [7] Multi-armed Bandits with Generalized Temporally-Partitioned Rewards ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT I, IDA 2024, 2024, 14641 : 41 - 52
- [8] Combinatorial Multi-Armed Bandits with Concave Rewards and Fairness Constraints PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2554 - 2560
- [9] Stochastic Multi-Armed Bandits with Non-Stationary Rewards Generated by a Linear Dynamical System 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 1460 - 1465
- [10] Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139