共 50 条
- [1] Conservative Offline Distributional Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [2] Conservative Offline Distributional Reinforcement Learning Advances in Neural Information Processing Systems, 2021, 23 : 19235 - 19247
- [3] Offline Quantum Reinforcement Learning in a Conservative Manner THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7148 - 7156
- [4] Adaptable Conservative Q-Learning for Offline Reinforcement Learning PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 200 - 212
- [5] Mildly Conservative Q-Learning for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [6] Conservative State Value Estimation for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] RORL: Robust Offline Reinforcement Learning via Conservative Smoothing ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [8] Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [9] OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15897 - 15905
- [10] VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,