CrowdRL: An End-to-End Reinforcement Learning Framework for Data Labelling

被引:11
|
作者
Li, Kaiyu [1 ]
Li, Guoliang [1 ]
Wang, Yong [1 ]
Huang, Yan [2 ]
Liu, Zitao [2 ]
Wu, Zhongqin [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] TAL Educ, Beijing, Peoples R China
来源
2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021) | 2021年
基金
国家重点研发计划;
关键词
reinforcement learning; crowdsourcing; data labelling; truth inference; ALGORITHMS; GAME;
D O I
10.1109/ICDE51399.2021.00032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data labelling is very important in many database and machine learning applications. Traditional methods rely on humans (workers or experts) to acquire labels. However, the human cost is rather expensive for a large dataset. Active learning based methods only label a small set of data with large uncertainty, train a model on these labelled data, and use the trained model to label the remainder unlabelled data. However they have two limitations. First, they cannot judiciously select appropriate data (task selection) and assign the tasks to proper humans (task assignment). Moreover, they independently process task selection and task assignment, which cannot capture the correlation between them. Second, they simply infer the truth of a task based on the answers from humans and the trained model (truth inference) by independently modeling humans and models. In other words, they ignore the correlation between them (the labelled data may have noise caused by humans with biases, and the model trained by the noisy labels may bring additional biases), and thus lead to poor inference results. To address these limitations, in this paper, we propose CrowdRL, an end-to-end reinforcement learning (RL) based framework for data labelling. To the best of our knowledge, CrowdRL is the first RL framework designed for the data labelling workflow by seamlessly integrating task selection, task assignment and truth inference together. CrowdRL fully utilizes the power of heterogeneous annotators (experts and crowdsourcing workers) and machine learning models together to infer the truth, which highly improves the quality of data labelling. CrowdRL uses RL to model task assignment and task selection, and designs an agent to judiciously assign tasks to appropriate workers. CrowdRL jointly models the answers of workers, experts and models, and designs a joint inference model to infer the truths. Experimental results on real datasets show that CrowdRL outperforms state-of-the-art approaches with the same (even fewer) monetary cost while achieving 5%-20% higher accuracy.
引用
收藏
页码:289 / 300
页数:12
相关论文
共 50 条
  • [1] Deep reinforcement learning framework for end-to-end semiconductor process control
    Hirtz T.
    Tian H.
    Shahzad S.
    Wu F.
    Yang Y.
    Ren T.-L.
    Neural Computing and Applications, 2024, 36 (20) : 12443 - 12460
  • [2] An End-to-End Deep Reinforcement Learning Framework for Electric Vehicle Routing Problem
    Wang, Mengqin
    Wei, Yanling
    Huang, Xueliang
    Gao, Shan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 33671 - 33682
  • [3] A framework for end-to-end learning on semantic tree-structured data
    Woof, William
    Chen, Ke
    arXiv, 2020,
  • [4] A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
    Zhao, Xun
    Huang, Xinjian
    Cheng, Jianheng
    Xia, Zhendong
    Tu, Zhiheng
    DRONES, 2024, 8 (11)
  • [5] End-to-End Video Captioning with Multitask Reinforcement Learning
    Li, Lijun
    Gong, Boqing
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 339 - 348
  • [6] NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement Learning
    Haj-Ali, Ameer
    Ahmed, Nesreen K.
    Willke, Ted
    Shao, Yakun Sophia
    Asanovic, Krste
    Stoica, Ion
    CGO'20: PROCEEDINGS OF THE18TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2020, : 242 - 255
  • [7] End-to-End Deep Reinforcement Learning for Exoskeleton Control
    Rose, Lowell
    Bazzocchi, Michael C. F.
    Nejat, Goldie
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4294 - 4301
  • [8] An End-to-End Learning Framework for Video Compression
    Lu, Guo
    Zhang, Xiaoyun
    Ouyang, Wanli
    Chen, Li
    Gao, Zhiyong
    Xu, Dong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3292 - 3308
  • [9] End-to-End Reinforcement Learning for Automatic Taxonomy Induction
    Mao, Yuning
    Ren, Xiang
    Shen, Jiaming
    Gu, Xiaotao
    Han, Jiawei
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2462 - 2472
  • [10] ORACLE: End-to-End Model Based Reinforcement Learning
    Andersen, Per-Arne
    Goodwin, Morten
    Granmo, Ole-Christoffer
    ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 44 - 57