共 50 条
- [22] Belief Function Model for Reliable Optimal Set Estimation of Transition Matrices in Discounted Infinite-Horizon Markov Decision Processes 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1214 - 1221
- [24] Approximate robust policy iteration for discounted infinite-horizon Markov decision processes with uncertain stationary parametric tiransition matrices 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2052 - 2057
- [25] Semi-infinite discounted Markov decision processes: Policy improvement and singular perturbations Mathematical Methods of Operations Research, 2001, 54 : 279 - 290
- [27] Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [30] Discounted Markov decision processes with fuzzy costs Annals of Operations Research, 2020, 295 : 769 - 786