共 50 条
- [3] Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [4] Path Planning Using Wasserstein Distributionally Robust Deep Q-learning 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [5] Sample Complexity of Kernel-Based Q-Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206 : 453 - 469
- [6] Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [7] The Sample Complexity of Teaching-by-Reinforcement on Q-Learning THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10939 - 10947
- [10] Sample Complexity of Decentralized Tabular Q-Learning for Stochastic Games 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 1098 - 1103