共 50 条
- [1] On the Estimation Bias in Double Q-Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [2] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
- [3] A controlling estimation bias method: Max_Mix_Min estimator for Q-learning JOURNAL OF SUPERCOMPUTING, 2024, 80 (13): : 19248 - 19273
- [5] Fuzzy Q-Learning for generalization of reinforcement learning FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2208 - 2214
- [6] Deep Reinforcement Learning with Double Q-Learning THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2094 - 2100
- [7] Reinforcement learning guidance law of Q-learning Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (02): : 414 - 419
- [8] Bias-Corrected Q-Learning to Control Max-Operator Bias in Q-Learning PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 93 - 99
- [9] Feasible Q-Learning for Average Reward Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [10] Mildly Conservative Q-Learning for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,