共 31 条
- [1] [Anonymous], 2003, J. Mach. Learn. Res.
- [2] [Anonymous], 2015, Reinforcement Learning: An Introduction
- [3] [Anonymous], 2010, Algorithms for Reinforcement Learning
- [4] [Anonymous], 1995, Dynamic Programming and Optimal Control
- [6] Barto A.G., 1988, NEURONLIKE ADAPTIVE, P834
- [7] Approximate policy iteration: A survey and some new methods [J]. Journal of Control Theory and Applications, 2011, 9 (3): : 310 - 335
- [8] Bradtke SJ, 1996, MACH LEARN, V22, P33, DOI 10.1007/BF00114723
- [9] Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
- [10] Dann C, 2014, J MACH LEARN RES, V15, P809