共 50 条
- [2] Generic Asymptotically Optimal Algorithms for Multi-Armed Bandits 2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 152 - 152
- [3] Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8726 - 8734
- [5] On Optimal Foraging and Multi-armed Bandits 2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2013, : 494 - 499
- [6] On a Class of Restless Multi-armed Bandits with Deterministic Policies 2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 487 - 491
- [7] Optimal Algorithms for Multiplayer Multi-Armed Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [8] Optimal Streaming Algorithms for Multi-Armed Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [9] Multi-armed Bandits: Competing with Optimal Sequences ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29