共 50 条
[21]
Active Preference-Based Learning of Reward Functions
[J].
ROBOTICS: SCIENCE AND SYSTEMS XIII,
2017,
[22]
Learning solution similarity in preference-based CBR
[J].
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),
2014, 8765
:17-31
[23]
Versatile Dueling Bandits: Best-of-both World Analyses for Online Learning from Relative Preferences
[J].
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162,
2022,
:19011-19026
[24]
Inverse Preference Learning: Preference-based RL without a Reward Function
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023),
2023,
[25]
Online Certification of Preference-Based Fairness for Personalized Recommender Systems
[J].
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE,
2022,
:6532-6540
[26]
Online Rank Elicitation for Plackett-Luce: A Dueling Bandits Approach
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015),
2015, 28
[27]
A Generalized Acquisition Function for Preference-based Reward Learning
[J].
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024,
2024,
:2814-2821
[28]
Model-Free Preference-Based Reinforcement Learning
[J].
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE,
2016,
:2222-2228
[29]
Embedding Learning for Preference-based Speech Quality Assessment
[J].
INTERSPEECH 2024,
2024,
:2685-2689
[30]
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach
[J].
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE,
2022,
:8797-8805