共 50 条
- [32] On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning Journal of Optimization Theory and Applications, 2000, 105 : 589 - 608
- [34] Distributed multi-agent temporal-difference learning with full neighbor information Control Theory and Technology, 2020, 18 : 379 - 389
- [38] On sharpness of error bounds for multivariate neural network approximation Ricerche di Matematica, 2022, 71 : 633 - 653
- [40] Temporal-difference emphasis learning with regularized correction for off-policy evaluation and control Applied Intelligence, 2023, 53 : 20917 - 20937