Machine learning from crowds: A systematic review of its applications

被引:9
作者
Rodrigo, Enrique G. [1 ]
Aledo, Juan A. [2 ]
Gamez, Jose A. [1 ]
机构
[1] Univ Castilla La Mancha, Dept Comp Syst, Campus Univ S-N, Albacete 02071, Spain
[2] Univ Castilla La Mancha, Dept Math, Albacete, Spain
关键词
applications; crowdsourcing; labeler quality; machine learning; multiple annotation; TRUTH DISCOVERY; CLASSIFICATION; NETWORKS; LABELS; AGGREGATION; RECOGNITION; ANNOTATION; INFERENCE; BEHAVIOR; QUALITY;
D O I
10.1002/widm.1288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowdsourcing opens the door to solving a wide variety of problems that previously were unfeasible in the field of machine learning, allowing us to obtain relatively low cost labeled data in a small amount of time. However, due to the uncertain quality of labelers, the data to deal with are sometimes unreliable, forcing practitioners to collect information redundantly, which poses new challenges in the field. Despite these difficulties, many applications of machine learning using crowdsourced data have recently been published that achieved state of the art results in relevant problems. We have analyzed these applications following a systematic methodology, classifying them into different fields of study, highlighting several of their characteristics and showing the recent interest in the use of crowdsourcing for machine learning. We also identify several exciting research lines based on the problems that remain unsolved to foster future research in this field.
引用
收藏
页数:23
相关论文
共 168 条
[1]   Human Activity Analysis: A Review [J].
Aggarwal, J. K. ;
Ryoo, M. S. .
ACM COMPUTING SURVEYS, 2011, 43 (03)
[2]   AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images [J].
Albarqouni, Shadi ;
Baur, Christoph ;
Achilles, Felix ;
Belagiannis, Vasileios ;
Demirci, Stefanie ;
Navab, Nassir .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1313-1321
[3]  
Alshutayri A., 2018, NATURAL LANGUAGE SPE, P1
[4]   Crowdsourcing Malaria Parasite Quantification: An Online Game for Analyzing Images of Infected Thick Blood Smears [J].
Angel Luengo-Oroz, Miguel ;
Arranz, Asier ;
Frean, John .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2012, 14 (06) :207-219
[5]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[6]  
[Anonymous], 2017, SCIMAGO J COUNTRY RA
[7]  
[Anonymous], 2012, 21st ACM International Conference on Information and Knowledge Management, CIKM'12, Maui, HI, USA, October 29 - November 02, 2012
[8]  
[Anonymous], 2011, PROC 28 INT C MACH L
[9]   Analysis of Underlying Causes of Inter-expert Disagreement in Retinopathy of Prematurity Diagnosis Application of Machine Learning Principles [J].
Ataer-Cansizoglu, E. ;
Kalpathy-Cramer, J. ;
You, S. ;
Keck, K. ;
Erdogmus, D. ;
Chiang, M. F. .
METHODS OF INFORMATION IN MEDICINE, 2015, 54 (01) :93-102
[10]   Harnessing Label Uncertainty to Improve Modeling: An Application to Student Engagement Recognition [J].
Aung, Arkar Min ;
Whitehill, Jacob R. .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :166-170