CrowdRL: An End-to-End Reinforcement Learning Framework for Data Labelling

被引：11

作者：

Li, Kaiyu ^{[1
]}

Li, Guoliang ^{[1
]}

Wang, Yong ^{[1
]}

Huang, Yan ^{[2
]}

Liu, Zitao ^{[2
]}

Wu, Zhongqin ^{[2
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] TAL Educ, Beijing, Peoples R China

来源：

2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021) | 2021年

基金：

国家重点研发计划;

关键词：

reinforcement learning; crowdsourcing; data labelling; truth inference; ALGORITHMS; GAME;

D O I：

10.1109/ICDE51399.2021.00032

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Data labelling is very important in many database and machine learning applications. Traditional methods rely on humans (workers or experts) to acquire labels. However, the human cost is rather expensive for a large dataset. Active learning based methods only label a small set of data with large uncertainty, train a model on these labelled data, and use the trained model to label the remainder unlabelled data. However they have two limitations. First, they cannot judiciously select appropriate data (task selection) and assign the tasks to proper humans (task assignment). Moreover, they independently process task selection and task assignment, which cannot capture the correlation between them. Second, they simply infer the truth of a task based on the answers from humans and the trained model (truth inference) by independently modeling humans and models. In other words, they ignore the correlation between them (the labelled data may have noise caused by humans with biases, and the model trained by the noisy labels may bring additional biases), and thus lead to poor inference results. To address these limitations, in this paper, we propose CrowdRL, an end-to-end reinforcement learning (RL) based framework for data labelling. To the best of our knowledge, CrowdRL is the first RL framework designed for the data labelling workflow by seamlessly integrating task selection, task assignment and truth inference together. CrowdRL fully utilizes the power of heterogeneous annotators (experts and crowdsourcing workers) and machine learning models together to infer the truth, which highly improves the quality of data labelling. CrowdRL uses RL to model task assignment and task selection, and designs an agent to judiciously assign tasks to appropriate workers. CrowdRL jointly models the answers of workers, experts and models, and designs a joint inference model to infer the truths. Experimental results on real datasets show that CrowdRL outperforms state-of-the-art approaches with the same (even fewer) monetary cost while achieving 5%-20% higher accuracy.

引用

页码：289 / 300

页数：12

共 50 条

[1] Deep reinforcement learning framework for end-to-end semiconductor process control
Hirtz T.
Tian H.
Shahzad S.
Wu F.
Yang Y.
Ren T.-L.
Neural Computing and Applications, 2024, 36 (20) : 12443 - 12460
[2] An End-to-End Deep Reinforcement Learning Framework for Electric Vehicle Routing Problem
Wang, Mengqin
Wei, Yanling
Huang, Xueliang
Gao, Shan
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 33671 - 33682
[3] A framework for end-to-end learning on semantic tree-structured data
Woof, William
Chen, Ke
arXiv, 2020,
[4] A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
Zhao, Xun
Huang, Xinjian
Cheng, Jianheng
Xia, Zhendong
Tu, Zhiheng
DRONES, 2024, 8 (11)
[5] End-to-End Video Captioning with Multitask Reinforcement Learning
Li, Lijun
Gong, Boqing
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 339 - 348
[6] NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement Learning
Haj-Ali, Ameer
Ahmed, Nesreen K.
Willke, Ted
Shao, Yakun Sophia
Asanovic, Krste
Stoica, Ion
CGO'20: PROCEEDINGS OF THE18TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2020, : 242 - 255
[7] End-to-End Deep Reinforcement Learning for Exoskeleton Control
Rose, Lowell
Bazzocchi, Michael C. F.
Nejat, Goldie
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4294 - 4301
[8] An End-to-End Learning Framework for Video Compression
Lu, Guo
Zhang, Xiaoyun
Ouyang, Wanli
Chen, Li
Gao, Zhiyong
Xu, Dong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3292 - 3308
[9] End-to-End Reinforcement Learning for Automatic Taxonomy Induction
Mao, Yuning
Ren, Xiang
Shen, Jiaming
Gu, Xiaotao
Han, Jiawei
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2462 - 2472
[10] ORACLE: End-to-End Model Based Reinforcement Learning
Andersen, Per-Arne
Goodwin, Morten
Granmo, Ole-Christoffer
ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 44 - 57

← 1 2 3 4 5 →