共 50 条
- [1] Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19027 - 19049
- [2] Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
- [3] ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [4] Stochastic Bandits with Graph Feedback in Non-Stationary Environments AAAI Conf. Artif. Intell., AAAI, 1600, (8758-8766): : 8758 - 8766
- [5] Stochastic Bandits with Graph Feedback in Non-Stationary Environments THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8758 - 8766
- [6] Randomized Exploration for Non-Stationary Stochastic Linear Bandits CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 71 - 80
- [7] Reward Attack on Stochastic Bandits with Non-stationary Rewards FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1387 - 1393
- [8] Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear Regret ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [9] Some algorithms for correlated bandits with non-stationary rewards : Regret bounds and applications PROCEEDINGS OF THE THIRD ACM IKDD CONFERENCE ON DATA SCIENCES (CODS), 2016,