共 50 条
- [1] Bias-Corrected Q-Learning to Control Max-Operator Bias in Q-Learning PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 93 - 99
- [3] On the Estimation Bias in Double Q-Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [4] Bias-corrected bootstrap and model uncertainty ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 521 - 528