共 50 条
- [1] Regret Bounds for Information-Directed Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [2] Reinforcement learning reward functions for unsupervised learning ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 397 - +
- [3] Reward Reports for Reinforcement Learning PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 84 - 130
- [5] EFFICIENT AND STABLE INFORMATION DIRECTED EXPLORATION FOR CONTINUOUS REINFORCEMENT LEARNING 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4023 - 4027
- [6] Actively learning costly reward functions for reinforcement learning MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (01):
- [8] Reinforcement Learning for Data Preparation with Active Reward Learning INTERNET SCIENCE, INSCI 2019, 2019, 11938 : 121 - 132
- [9] Active Learning for Reward Estimation in Inverse Reinforcement Learning MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 31 - +
- [10] Learning Reward Machines for Partially Observable Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32