共 50 条
- [33] Adaptive aggregation for reinforcement learning in average reward Markov decision processes Annals of Operations Research, 2013, 208 : 321 - 336
- [34] Average Reward Reinforcement Learning for Semi-Markov Decision Processes NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [37] Optimization of Parametric Policies of Markov Decision Processes under a Variance Criterion 2016 13TH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS (WODES), 2016, : 332 - 337
- [38] Game Theoretic Markov Decision Processes for Optimal Decision Making in Social Systems 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 268 - 272
- [39] Perceptive evaluation for the optimal discounted reward in Markov decision processes MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3558 : 283 - 293