共 50 条
- [1] Copeland Dueling Bandit Problem: Regret Lower Bound, Optimal Algorithm, and Computationally Efficient Algorithm INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [2] High-dimensional Contextual Bandit Problem without Sparsity ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [3] Linear Upper Confidence Bound Algorithm for Contextual Bandit Problem with Piled Rewards ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 143 - 155
- [4] HIGH DIMENSIONAL STOCHASTIC LINEAR CONTEXTUAL BANDIT WITH MISSING COVARIATES 2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
- [5] Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [6] Efficient and Robust High-Dimensional Linear Contextual Bandits PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4259 - 4265
- [7] An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 2783 - 2788
- [8] Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [9] Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback INTERNATIONAL CONFERENCE ON MACHINE LEARNING, 2024, 235