共 50 条
- [3] Constrained regret minimization for multi-criterion multi-armed bandits Machine Learning, 2023, 112 : 431 - 458
- [4] Lenient Regret for Multi-Armed Bandits THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8950 - 8957
- [5] Fairness and Welfare Quantification for Regret in Multi-Armed Bandits THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6762 - 6769
- [6] Bounded Regret for Finitely Parameterized Multi-Armed Bandits IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (03): : 1073 - 1078
- [7] Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [8] Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [9] Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [10] Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10085 - 10092