Dynamic Estimation of Rater Reliability in Subjective Tasks Using Multi-Armed Bandits

被引:0
作者
Tarasov, Alexey [1 ]
Delany, Sarah Jane [1 ]
Mac Namee, Brian [1 ]
机构
[1] Dublin Inst Technol, Sch Comp, Dublin, Ireland
来源
Proceedings of 2012 ASE/IEEE International Conference on Privacy, Security, Risk and Trust and 2012 ASE/IEEE International Conference on Social Computing (SocialCom/PASSAT 2012) | 2012年
基金
爱尔兰科学基金会;
关键词
human computation; crowdsourcing; multi-armed bandits; learning from multiple sources; training data; supervised machine learning;
D O I
10.1109/SocialCom-PASSAT.2012.50
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many application areas that use supervised machine learning make use of multiple raters to collect target ratings for training data. Usage of multiple raters, however, inevitably introduces the risk that a proportion of them will be unreliable. The presence of unreliable raters can prolong the rating process, make it more expensive and lead to inaccurate ratings. The dominant, "static" approach of solving this problem in state-of-the-art research is to estimate the rater reliability and to calculate the target ratings when all ratings have been gathered. However, doing it dynamically while raters rate training data can make the acquisition of ratings faster and cheaper compared to static techniques. We propose to cast the problem of the dynamic estimation of rater reliability as a multi-armed bandit problem. Experiments show that the usage of multi-armed bandits for this problem is worthwhile, providing that each rater can rate any asset when asked. The purpose of this paper is to outline the directions of future research in this area.
引用
收藏
页码:979 / 980
页数:2
相关论文
共 8 条
[1]  
[Anonymous], P NIPS 94
[2]  
[Anonymous], P HCOMP
[3]  
Cholleti S., 2008, TECH REP
[4]  
Donmez P., 2009, P KDD
[5]  
Law E., 2011, Human Computation
[6]  
Raykar VC, 2010, J MACH LEARN RES, V11, P1297
[7]  
Snel J., 2012, 4 INT WORKSH CORP RE, P72
[8]  
Tarasov A., 2012, P MLHCC WORKSH ICML