共 27 条
- [1] [Anonymous], 2009, Stochastic approximation: A dynamical systems viewpoint
- [2] [Anonymous], 2007, Advances in Neural Information Processing Systems
- [3] Baras J. S., 1997, J. Math. Systems, Estimation & Control, V7, P371
- [4] Q-learning for risk-sensitive control [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (02) : 294 - 311
- [6] Gu SX, 2016, PR MACH LEARN RES, V48
- [9] Konda VR, 2000, ADV NEUR IN, V12, P1008
- [10] La P., 2013, ADV NEURAL INFORM PR, V26