共 20 条
- [2] Linear Convergence of Independent Natural Policy Gradient in Games With Entropy Regularization IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1217 - 1222
- [6] Generalized Compatible Function Approximation for Policy Gradient Search NEURAL INFORMATION PROCESSING, ICONIP 2016, PT I, 2016, 9947 : 615 - 622
- [9] On the Convergence of Temporal-Difference Learning with Linear Function Approximation Machine Learning, 2001, 42 : 241 - 267
- [10] A policy gradient reinforcement learning algorithm with fuzzy function approximation IEEE ROBIO 2004: Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2004, : 936 - 940