共 35 条
[2]
[Anonymous], 1989, LEARNING DELAYED REW
[3]
[Anonymous], 2015, Reinforcement Learning: An Introduction
[8]
Darken C., 1992, Neural Networks for Signal Processing II. Proceedings of the IEEE-SP Workshop (Cat. No.92TH0430-9), P3, DOI 10.1109/NNSP.1992.253713