共 54 条
[1]
Agarwal Alekh, 2019, ARXIV190603804
[2]
[Anonymous], 2011, Technical report
[3]
[Anonymous], 2019, ADV NEURAL INFORM PR, DOI DOI 10.1109/PIMRC.2019.8904340
[4]
[Anonymous], 2013, ADV NEURAL INFORM PR
[6]
Bai Y, 2019, ADV NEUR IN, V32
[7]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208
[8]
Bhandari Jalaj, 2018, C LEARN THEOR COLT, P1691
[10]
Cai Q., 2019, Advances in Neural Information Processing Systems, P11312