共 50 条
- [11] A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning Discrete Event Dynamic Systems, 2006, 16 : 207 - 239
- [12] A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2006, 16 (02): : 207 - 239
- [13] A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [14] IMPROVING REINFORCEMENT LEARNING USING TEMPORAL-DIFFERENCE NETWORK EUROCON2009 EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1716 - 1722
- [18] Multi-Agent Temporal-Difference Learning with Linear Function Approximation: Weak Convergence under Time-Varying Network Topologies 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 167 - 172
- [19] On the asymptotic behavior of a constant stepsize temporal-difference learning algorithm COMPUTATIONAL LEARNING THEORY, 1999, 1572 : 126 - 137
- [20] Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes Machine Learning, 2006, 63 : 107 - 133