共 50 条
- [1] Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [2] Unimodal Bandits: Regret Lower Bounds and Optimal Algorithms INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [3] Regret Bounds for Batched Bandits THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7340 - 7348
- [7] Near-optimal Per-Action Regret Bounds for Sleeping Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [8] On Regret-Optimal Learning in Decentralized Multiplayer Multiarmed Bandits IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2018, 5 (01): : 597 - 606
- [9] Near-Optimal Collaborative Learning in Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [10] Complete Policy Regret Bounds for Tallying Bandits CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178