共 35 条
[1]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208
[2]
Bellemare MG, 2017, PR MACH LEARN RES, V70
[4]
Chen C.-T., 1995, Linear System Theory and Design
[5]
Chen ZW, 2021, Arxiv, DOI arXiv:2102.01567
[6]
Devraj AdithyaM., 2017, Proc. of the Intl. Conference on Neural Information Processing Systems, P2232
[7]
Even-Dar E, 2003, J MACH LEARN RES, V5, P1
[8]
Ghavamzadeh Mohammad., 2011, ADV NEURAL INFORM PR, P2411
[10]
Heess N, 2015, Arxiv, DOI arXiv:1512.04455