共 50 条
- [4] On the Convergence of Temporal-Difference Learning with Linear Function Approximation Machine Learning, 2001, 42 : 241 - 267
- [6] Temporal-difference learning with nonlinear function approximation: lazy training and mean field regimes MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145, 2021, 145 : 37 - 74
- [7] Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (01): : 298 - 320