共 50 条
- [3] On the Convergence of Temporal-Difference Learning with Linear Function Approximation Machine Learning, 2001, 42 : 241 - 267
- [4] Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (01): : 298 - 320
- [6] A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning Discrete Event Dynamic Systems, 2006, 16 : 207 - 239
- [7] A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2006, 16 (02): : 207 - 239