共 23 条
- [1] [Anonymous], 1992, Stochastic Stability of Markov chains
- [3] Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
- [4] Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V2
- [5] The relations among potentials, perturbation analysis, and Markov decision processes [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 1998, 8 (01): : 71 - 87
- [8] From perturbation analysis to Markov decision processes and reinforcement learning [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (1-2): : 9 - 39
- [10] A time aggregation approach to Markov decision processes [J]. AUTOMATICA, 2002, 38 (06) : 929 - 943