共 23 条
[1]
Adibi A, 2024, PR MACH LEARN RES, V238
[2]
[Anonymous], 2008, Stochastic Approximation: A Dynamical Systems View-point
[3]
Arjevani Y, 2020, PR MACH LEARN RES, V117, P111
[4]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208
[5]
Bhandari J., 2018, C LEARN THEOR COLT, P1691
[6]
Borkar V, 2024, Arxiv, DOI arXiv:2110.14427
[7]
Chen Z., 2019, PREPRINT
[8]
Dalal G, 2018, AAAI CONF ARTIF INTE, P6144
[10]
Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
[J].
Performance Evaluation Review,
2023, 51 (01)
:81-82