Preference-based online learning with dueling bandits: A survey

被引:0
作者
Bengs, Viktor [1 ]
Busa-Fekete, Robert [2 ]
Mesaoudi-Paul, Adil El [1 ]
Hullermeier, Eyke [1 ]
机构
[1] Heinz Nixdorf Institute, Department of Computer Science, Paderborn University, Germany
[2] Google Research, New York,NY, United States
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
[41]   Active Preference-Based Gaussian Process Regression for Reward Learning [J].
Biyik, Lirdem ;
Huynh, Nicolas ;
Kochenderfer, Mykel J. ;
Sadigh, Dorsa .
ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
[42]   Listwise Reward Estimation for Offline Preference-based Reinforcement Learning [J].
Choi, Heewoong ;
Jung, Sangwon ;
Ahn, Hongjoon ;
Moon, Taesup .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, 2024, 235
[43]   Preference-based Reinforcement Learning with Finite-Time Guarantees [J].
Xu, Yichong ;
Wang, Ruosong ;
Yang, Lin F. ;
Singh, Aarti ;
Dubrawski, Artur .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[44]   RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences [J].
Cheng, Jie ;
Xiong, Gang ;
Dai, Xingyuan ;
Miao, Qinghai ;
Lv, Yisheng ;
Wang, Fei-Yue .
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, 2024, 235
[45]   Preference-based valuation of treatment attributes in haemophilia A using web survey [J].
Carlsson, K. Steen ;
Andersson, E. ;
Berntorp, E. .
HAEMOPHILIA, 2017, 23 (06) :894-903
[46]   Preference-based belief operators [J].
Asheim, GB ;
Sovik, Y .
MATHEMATICAL SOCIAL SCIENCES, 2005, 50 (01) :61-82
[47]   Online Certification of Preference-Based Fairness for Personalized Recommender Systems (Extended Abstract) [J].
Do, Virginie ;
Corbett-Davies, Sam ;
Atif, Jamal ;
Usunier, Nicolas .
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, :6426-6430
[48]   Applying Preference-based Customization [J].
Liaskos, Sotirios ;
Rogozhkin, Vyacheslav .
2011 19TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2011, :353-+
[49]   Preference-Based Offline Evaluation [J].
Clarke, Charles L. A. ;
Diaz, Fernando ;
Arabzadeh, Negar .
PROCEEDINGS OF THE SIXTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2023, VOL 1, 2023, :1248-1251
[50]   Preference-Based Trajectory Generation [J].
Lennon, Jamie A. ;
Atkins, Ella M. .
JOURNAL OF AEROSPACE COMPUTING INFORMATION AND COMMUNICATION, 2009, 6 (03) :142-170