共 50 条
- [21] Faster Non-asymptotic Convergence for Double Q-learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [22] On Q-learning Convergence for Non-Markov Decision Processes PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2546 - 2552
- [24] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
- [25] An algorithm that excavates suboptimal states and improves Q-learning ENGINEERING RESEARCH EXPRESS, 2024, 6 (04):
- [28] Finite-sample convergence rates for Q-learning and indirect algorithms ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 996 - 1002
- [29] Safe Q-learning for continuous-time linear systems 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 241 - 246