共 46 条
- [1] Kernel-based least squares policy iteration for reinforcement learning IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 973 - 992
- [4] Experience replay for least-squares policy iteration Liu, Quan (quanliu@suda.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (01): : 274 - 281
- [5] Sparse Kernel-Based Least Squares Temporal Difference with Prioritized Sweeping NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 221 - 230