共 50 条
- [45] Episodic task learning in Markov decision processes Artificial Intelligence Review, 2011, 36 : 87 - 98
- [48] Variance minimization of parameterized Markov decision processes Discrete Event Dynamic Systems, 2018, 28 : 63 - 81
- [50] Ranking policies in discrete Markov decision processes Annals of Mathematics and Artificial Intelligence, 2010, 59 : 107 - 123