共 50 条
- [3] Value Iteration and Action ε-Approximation of Optimal Policies in Discounted Markov Decision Processes RECENT ADVANCES IN APPLIED MATHEMATICS, 2009, : 213 - +
- [6] Adaptive control for discrete-time Markov processes with unbounded costs: Average criterion Mathematical Methods of Operations Research, 1998, 48 : 37 - 55