共 44 条
- [1] [Anonymous], 1996, Neuro-dynamic programming
- [2] Cao X.R., 2007, DISCRETE EVENT DYN S, V15, P169
- [3] Cao Xi- Ren, 2007, STOCHASTIC LEARNING
- [7] From perturbation analysis to Markov decision processes and reinforcement learning [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (1-2): : 9 - 39
- [10] CAO XR, 1994, REALIZATION PROBABIL