共 50 条
- [35] The Variance of Discounted Rewards in Markov Decision Processes: Laurent Expansion and Sensitive Optimality MATHEMATICAL METHODS IN ECONOMICS (MME 2014), 2014, : 908 - 913
- [37] APPROXIMATING THE MARKOV PROPERTY IN MARKOV DECISION-PROCESSES INFORMATION AND DECISION TECHNOLOGIES, 1989, 15 (03): : 147 - 162