共 42 条
[1]
Abed-alguni B. H., 2018, Int. J. Artif. Intell, V16, P41
[2]
[Anonymous], 1996, NEURODYNAMIC PROGRAM
[3]
[Anonymous], 1995, Machine Learning Proceedings 1995
[5]
Error bounds for constant step-size Q-learning
[J].
SYSTEMS & CONTROL LETTERS,
2012, 61 (12)
:1203-1208
[6]
Bhandari J., 2018, C LEARN THEOR COLT, P1691
[8]
Cai Q, 2019, 33 C NEURAL INFORM P, V32
[9]
Carvalho D.., 2020, Advances in Neural Information Processing Systems
[10]
Chen Z., 2021, LYAPUNOV THEORY FINI