共 50 条
- [23] On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning Journal of Optimization Theory and Applications, 2000, 105 : 589 - 608
- [25] On Average Versus Discounted Reward Temporal-Difference Learning Machine Learning, 2002, 49 : 179 - 191
- [28] Distributed multi-agent temporal-difference learning with full neighbor information Control Theory and Technology, 2020, 18 : 379 - 389
- [29] Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes Machine Learning, 2006, 63 : 107 - 133