共 50 条
- [1] Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes Machine Learning, 2006, 63 : 107 - 133
- [2] On the asymptotic behavior of a constant stepsize temporal-difference learning algorithm COMPUTATIONAL LEARNING THEORY, 1999, 1572 : 126 - 137
- [6] A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [7] Temporal-Difference Learning for Online Reachability Analysis 2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 2508 - 2513
- [9] Analysis of temporal-difference learning with function approximation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 1075 - 1081
- [10] On the mean-square rate of convergence of temporal-difference learning algorithms PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 1454 - 1459