共 50 条
- [1] Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (01): : 45 - 67
- [2] Reward learning from human preferences and demonstrations in Atari ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [5] Learning Noise-Induced Reward Functions for Surpassing Demonstrations in Imitation Learning THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 7953 - 7961
- [7] Reward Learning from Narrated Demonstrations 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7004 - 7013
- [9] Model-based Adversarial Imitation Learning from Demonstrations and Human Reward 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1683 - 1690
- [10] Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery 2024 INTERNATIONAL SYMPOSIUM ON MEDICAL ROBOTICS, ISMR 2024, 2024,