共 50 条
- [21] Active Learning for Reward Estimation in Inverse Reinforcement Learning MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 31 - +
- [22] Learning Reward Machines for Partially Observable Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [24] Direct reward and indirect reward in multi-agent reinforcement learning ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 359 - 366
- [25] Reinforcement Learning with Reward Shaping and Hybrid Exploration in Sparse Reward Scenes 2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,
- [27] Reward Certification for Policy Smoothed Reinforcement Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21429 - 21437
- [28] Direct reward and indirect reward in multi-agent reinforcement learning Ohta, M. (ohta@carc.aist.go.jp), (Springer Verlag):
- [29] A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 113 - 117
- [30] Reinforcement Learning in Reward-Mixing MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34