共 50 条
- [1] The value of information in multi-armed bandits with exponentially distributed rewards PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1363 - 1372
- [2] Trading off Rewards and Errors in Multi-Armed Bandits ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 709 - 717
- [3] Parametrized Stochastic Multi-armed Bandits with Binary Rewards 2011 AMERICAN CONTROL CONFERENCE, 2011, : 119 - 124
- [4] Multi-armed Bandits with Generalized Temporally-Partitioned Rewards ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT I, IDA 2024, 2024, 14641 : 41 - 52
- [5] Combinatorial Multi-Armed Bandits with Concave Rewards and Fairness Constraints PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2554 - 2560
- [6] Multi-player Multi-armed Bandits: Decentralized Learning with IID Rewards 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 853 - 860
- [7] Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [8] Maximizing and Satisficing in Multi-armed Bandits with Graph Information ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [9] On Kernelized Multi-armed Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [10] Multi-armed Bandits with Compensation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31