A Crowdsourcing Worker Quality Evaluation Algorithm on MapReduce for Big Data Applications

被引:16
作者
Dang, Depeng [1 ]
Liu, Ying [1 ]
Zhang, Xiaoran [1 ]
Huang, Shihang [1 ]
机构
[1] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
基金
中国国家自然科学基金; 国家教育部科学基金资助;
关键词
Crowdsourcing systems; quality control; big data; mapreduce; hadoop; SYSTEMS;
D O I
10.1109/TPDS.2015.2457924
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Crowdsourcing is a new emerging distributed computing and business model on the backdrop of Internet blossoming. With the development of crowdsourcing systems, the data size of crowdsourcers, contractors and tasks grows rapidly. The worker quality evaluation based on big data analysis technology has become a critical challenge. This paper first proposes a general worker quality evaluation algorithm that is applied to any critical tasks such as tagging, matching, filtering, categorization and many other emerging applications, without wasting resources. Second, we realize the evaluation algorithm in the Hadoop platform using the MapReduce parallel programming model. Finally, to effectively verify the accuracy and the effectiveness of the algorithm in a wide variety of big data scenarios, we conduct a series of experiments. The experimental results demonstrate that the proposed algorithm is accurate and effective. It has high computing performance and horizontal scalability. And it is suitable for large-scale worker quality evaluations in a big data environment.
引用
收藏
页码:1879 / 1888
页数:10
相关论文
共 47 条
[1]   Quality Control in Crowdsourcing Systems Issues and Directions [J].
Allahbakhsh, Mohammad ;
Benatallah, Boualem ;
Ignjatovic, Aleksandar ;
Motahari-Nezhad, Hamid Reza ;
Bertino, Elisa ;
Dustdar, Schahram .
IEEE INTERNET COMPUTING, 2013, 17 (02) :76-81
[2]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[3]  
[Anonymous], 2009, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
[4]  
[Anonymous], 2012, P 11 INT C AUT AG MU
[5]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[6]  
[Anonymous], 1979, J R STAT SOC C-APPL, DOI 10.2307/2346806
[7]  
[Anonymous], 2006, WIRED MAG
[8]  
[Anonymous], 2010, Proceedings of the ACM SIGKDD Workshop on Human Computation, noeth, DOI [10.1145/1837885.1837906, DOI 10.1145/1837885.1837906]
[9]  
Brabham DC., 2008, CONVERGENCE-US, V14, P75, DOI [10.1177/1354856507084420, DOI 10.1177/1354856507084420]
[10]   Using Crowdsourcing and Active Learning to Track Sentiment in Online Media [J].
Brew, Anthony ;
Greene, Derek ;
Cunningham, Padraig .
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 :145-150