Machine learning from crowds: A systematic review of its applications

被引:9
作者
Rodrigo, Enrique G. [1 ]
Aledo, Juan A. [2 ]
Gamez, Jose A. [1 ]
机构
[1] Univ Castilla La Mancha, Dept Comp Syst, Campus Univ S-N, Albacete 02071, Spain
[2] Univ Castilla La Mancha, Dept Math, Albacete, Spain
关键词
applications; crowdsourcing; labeler quality; machine learning; multiple annotation; TRUTH DISCOVERY; CLASSIFICATION; NETWORKS; LABELS; AGGREGATION; RECOGNITION; ANNOTATION; INFERENCE; BEHAVIOR; QUALITY;
D O I
10.1002/widm.1288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowdsourcing opens the door to solving a wide variety of problems that previously were unfeasible in the field of machine learning, allowing us to obtain relatively low cost labeled data in a small amount of time. However, due to the uncertain quality of labelers, the data to deal with are sometimes unreliable, forcing practitioners to collect information redundantly, which poses new challenges in the field. Despite these difficulties, many applications of machine learning using crowdsourced data have recently been published that achieved state of the art results in relevant problems. We have analyzed these applications following a systematic methodology, classifying them into different fields of study, highlighting several of their characteristics and showing the recent interest in the use of crowdsourcing for machine learning. We also identify several exciting research lines based on the problems that remain unsolved to foster future research in this field.
引用
收藏
页数:23
相关论文
共 168 条
[91]   A survey of the use of crowdsourcing in software engineering [J].
Mao, Ke ;
Capra, Licia ;
Harman, Mark ;
Jia, Yue .
JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 126 :57-84
[92]   A Mathematical Framework for Combining Decisions of Multiple Experts toward Accurate and Remote Diagnosis of Malaria Using Tele-Microscopy [J].
Mavandadi, Sam ;
Feng, Steve ;
Yu, Frank ;
Dimitrov, Stoyan ;
Nielsen-Saines, Karin ;
Prescott, William R. ;
Ozcan, Aydogan .
PLOS ONE, 2012, 7 (10)
[93]   Classifying GABAergic interneurons with semi-supervised projected model-based clustering [J].
Mihaljevic, Bojan ;
Benavides-Piccione, Ruth ;
Guerra, Luis ;
DeFelipe, Javier ;
Larranaga, Pedro ;
Bielza, Concha .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2015, 65 (01) :49-59
[94]   Efficient Crowd-Powered Active Learning for Reliable Review Evaluation [J].
Min, Xinping ;
Shi, Yuliang ;
Cui, Lizhen ;
Yu, Han ;
Miao, Yuan .
PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON CROWD SCIENCE AND ENGINEERING ICCSE 2017, 2017, :136-143
[95]   Crowdsourcing as a Novel Technique for Retinal Fundus Photography Classification: Analysis of Images in the EPIC Norfolk Cohort on Behalf of the UKBiobank Eye and Vision Consortium [J].
Mitry, Danny ;
Peto, Tunde ;
Hayat, Shabina ;
Morgan, James E. ;
Khaw, Kay-Tee ;
Foster, Paul J. .
PLOS ONE, 2013, 8 (08)
[96]   Detecting Low-Quality Workers in QoE Crowdtesting: A Worker Behavior-Based Approach [J].
Mok, Ricky K. P. ;
Chang, Rocky K. C. ;
Li, Weichao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) :530-543
[97]   Crowdsourcing for climate and atmospheric sciences: current status and future potential [J].
Muller, C. L. ;
Chapman, L. ;
Johnston, S. ;
Kidd, C. ;
Illingworth, S. ;
Foody, G. ;
Overeem, A. ;
Leigh, R. R. .
INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2015, 35 (11) :3185-3203
[98]   Human Activity Recognition by Combining a Small Number of Classifiers [J].
Nazabal, Alfredo ;
Garcia-Moreno, Pablo ;
Artes-Rodriguez, Antonio ;
Ghahramani, Zoubin .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2016, 20 (05) :1342-1351
[99]   Source code fragment summarization with small-scale crowdsourcing based features [J].
Nazar, Najam ;
Jiang, He ;
Gao, Guojun ;
Zhang, Tao ;
Li, Xiaochen ;
Ren, Zhilei .
FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (03) :504-517
[100]  
Ng Jun-Ping., 2012, P COLING 2012, P2109