共 50 条
- [1] Reinforcement learning based algorithms for average cost Markov Decision Processes DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (01): : 23 - 52
- [3] Adaptive aggregation for reinforcement learning in average reward Markov decision processes Annals of Operations Research, 2013, 208 : 321 - 336
- [4] Average Reward Reinforcement Learning for Semi-Markov Decision Processes NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [6] Reinforcement Learning for Cost-Aware Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [7] Reinforcement Learning Algorithms for Regret Minimization in Structured Markov Decision Processes AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1289 - 1290
- [8] A reinforcement learning based algorithm for Markov decision processes 2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 199 - 204
- [9] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679