共 20 条
[1]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208
[3]
Chen ZW, 2021, Arxiv, DOI arXiv:2102.01567
[4]
Even-Dar E, 2003, J MACH LEARN RES, V5, P1
[5]
Ghavamzadeh Mohammad, 2011, Advances in Neural Information Processing Systems, V24, P2411
[7]
Hasselt H., 2010, Advances in neural information processing systems, V23
[8]
Wainwright MJ, 2019, Arxiv, DOI arXiv:1905.06265