共 50 条
- [41] Reinforcement learning with nonstationary reward depending on the episode 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2145 - 2150
- [42] Inverse Reinforcement Learning with the Average Reward Criterion ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [44] Balancing multiple sources of reward in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1082 - 1088
- [46] Evolved Intrinsic Reward Functions for Reinforcement Learning PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1955 - 1956
- [49] Hindsight Reward Shaping in Deep Reinforcement Learning 2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 653 - 659
- [50] Robust Average-Reward Reinforcement Learning Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803