DLTA: A Framework for Dynamic Crowdsourcing Classification Tasks

被引：6

作者：

Zheng, Libin ^{[1
]}

Chen, Lei ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2019年 / 31卷 / 05期

基金：

美国国家科学基金会;

关键词：

Classification crowdsourcing; quality control; label inference; label acquisition;

D O I：

10.1109/TKDE.2018.2849385

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The increasing popularity of crowdsourcing markets enables the application of crowdsourcing classification tasks. How to conduct quality control in such an application to achieve accurate classification results from noisy workers is an important and challenging task, and has drawn broad research interests. However, most existing works do not exploit the label acquisition phase, which results in their disability of making a proper budget allocation. Moreover, some works impractically make the assumption of managing workers, which is not supported by common crowdsourcing platforms such as AMT or CrowdFlower. To overcome these drawbacks, in this paper, we devise a Dynamic Label Acquisition and Answer Aggregation (DLTA) framework for crowdsourcing classification tasks. The framework proceeds in a sequence of rounds, adaptively conducting label inference and label acquisition. In each round, it analyzes the collected answers of previous rounds to perform proper budget allocation, and then issues the resultant query to the crowd. To support DLTA, we propose a generative model for the collection of labels, and correspondingly strategies for label inference and budget allocation. Experimental results show that compared with existing methods, DLTA obtains competitive accuracy in the binary case. Besides, its extended version, which plugs in the state-of-the-art inference technique, achieves the highest accuracy.

引用

页码：867 / 879

页数：13

共 36 条

[1]

[Anonymous], 2009, Advances in Neural Information Processing Systems

[2]

[Anonymous], INT C POW SYST TECHN

[3]

[Anonymous], 2012, ADV NEURAL INFORM PR

[4]

[Anonymous], 2011, PROC 28 INT C MACH L

[5]

Bachrach Y., 2012, Proceedings of the 29th International Conference on Machine Learning (ICML-12), P1183

[6] Asking the Right Questions in Crowd Data Sourcing [J].

Boim, Rubi ;

Greenshpan, Ohad ;

Milo, Tova ;

Novgorodov, Slava ;

Polyzotis, Neoklis ;

Tan, Wang-Chiew .

2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, :1261-1264

[7]

Chen Xi, 2013, PMLR, P64

[8]

Dalvi N., 2013, P 22 INT C WORLD WID, P285

[9] Towards Globally Optimal Crowdsourcing Quality Management: The Uniform Worker Setting [J].

Das Sarma, Akash ;

Parameswaran, Aditya ;

Widom, Jennifer .

SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :47-62

[10]

Dawid A. P., 1979, J ROYAL STAT SOC SER, V28, P20

← 1 2 3 4 →