CrowdRL: An End-to-End Reinforcement Learning Framework for Data Labelling

被引:13
作者
Li, Kaiyu [1 ]
Li, Guoliang [1 ]
Wang, Yong [1 ]
Huang, Yan [2 ]
Liu, Zitao [2 ]
Wu, Zhongqin [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] TAL Educ, Beijing, Peoples R China
来源
2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021) | 2021年
基金
国家重点研发计划;
关键词
reinforcement learning; crowdsourcing; data labelling; truth inference; ALGORITHMS; GAME;
D O I
10.1109/ICDE51399.2021.00032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data labelling is very important in many database and machine learning applications. Traditional methods rely on humans (workers or experts) to acquire labels. However, the human cost is rather expensive for a large dataset. Active learning based methods only label a small set of data with large uncertainty, train a model on these labelled data, and use the trained model to label the remainder unlabelled data. However they have two limitations. First, they cannot judiciously select appropriate data (task selection) and assign the tasks to proper humans (task assignment). Moreover, they independently process task selection and task assignment, which cannot capture the correlation between them. Second, they simply infer the truth of a task based on the answers from humans and the trained model (truth inference) by independently modeling humans and models. In other words, they ignore the correlation between them (the labelled data may have noise caused by humans with biases, and the model trained by the noisy labels may bring additional biases), and thus lead to poor inference results. To address these limitations, in this paper, we propose CrowdRL, an end-to-end reinforcement learning (RL) based framework for data labelling. To the best of our knowledge, CrowdRL is the first RL framework designed for the data labelling workflow by seamlessly integrating task selection, task assignment and truth inference together. CrowdRL fully utilizes the power of heterogeneous annotators (experts and crowdsourcing workers) and machine learning models together to infer the truth, which highly improves the quality of data labelling. CrowdRL uses RL to model task assignment and task selection, and designs an agent to judiciously assign tasks to appropriate workers. CrowdRL jointly models the answers of workers, experts and models, and designs a joint inference model to infer the truths. Experimental results on real datasets show that CrowdRL outperforms state-of-the-art approaches with the same (even fewer) monetary cost while achieving 5%-20% higher accuracy.
引用
收藏
页码:289 / 300
页数:12
相关论文
共 49 条
[1]   MIN-MAX HEAPS AND GENERALIZED PRIORITY-QUEUES [J].
ATKINSON, MD ;
SACK, JR ;
SANTORO, N ;
STROTHOTTE, T .
COMMUNICATIONS OF THE ACM, 1986, 29 (10) :996-1000
[2]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[3]  
Aydin BI, 2014, AAAI CONF ARTIF INTE, P2946
[4]  
Beygelzimer Alina, 2009, P 26 ANN INT C MACH, P49, DOI DOI 10.1145/1553374.1553381
[5]  
Bringer E., 2019, SIGMOD
[6]   Crowdsourcing Database Systems: Overview and Challenges [J].
Chai, Chengliang ;
Fan, Ju ;
Li, Guoliang ;
Wang, Jiannan ;
Zheng, Yudian .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :2052-2055
[7]   A partial-order-based framework for cost-effective crowdsourced entity resolution [J].
Chai, Chengliang ;
Li, Guoliang ;
Li, Jian ;
Deng, Dong ;
Feng, Jianhua .
VLDB JOURNAL, 2018, 27 (06) :745-770
[8]   Active learning with statistical models [J].
Cohn, DA ;
Ghahramani, Z ;
Jordan, MI .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145
[9]   GOGGLES: Automatic Image Labeling with Affinity Coding [J].
Das, Nilaksh ;
Chaba, Sanya ;
Wu, Renzhi ;
Gandhi, Sakshi ;
Chau, Duen Horng ;
Chu, Xu .
SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, :1717-1732
[10]   Deep Learning for User Interest and Response Prediction in Online Display Advertising [J].
Gharibshah, Zhabiz ;
Zhu, Xingquan ;
Hainline, Arthur ;
Conway, Michael .
DATA SCIENCE AND ENGINEERING, 2020, 5 (01) :12-26