共 50 条
- [32] Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [33] Knowledge Revision for Reinforcement Learning with Abstract MDPs PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 763 - 770
- [34] Reinforcement Learning in Parametric MDPs with Exponential Families 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [35] Near-Optimal Interdiction of Factored MDPs CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
- [36] Knowledge Revision for Reinforcement Learning with Abstract MDPs AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1535 - 1536
- [37] Reinforcement Learning in Reward-Mixing MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [38] TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs FROM ANIMALS TO ANIMATS 11, 2010, 6226 : 489 - +
- [39] Safety-Constrained Reinforcement Learning for MDPs TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS (TACAS 2016), 2016, 9636 : 130 - 146