共 50 条
- [1] Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [3] Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [4] Provably Efficient Offline Reinforcement Learning in Regular Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [5] Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [6] Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [7] On Efficient Sampling in Offline Reinforcement Learning 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1 - 6
- [8] Provably Safe Reinforcement Learning with Step-wise Violation Constraints ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [9] Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [10] Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33