Semi-Automated Data Labeling

被引:0
|
作者
Desmond, Michael [1 ]
Duesterwald, Evelyn [1 ]
Brimijoin, Kristina [1 ]
Brachman, Michelle [1 ]
Pan, Qian [1 ]
机构
[1] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
来源
NEURIPS 2020 COMPETITION AND DEMONSTRATION TRACK, VOL 133 | 2020年 / 133卷
关键词
Data Labeling; Human Computer Interaction; Interactive Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Labeling data is often a tedious and error-prone activity. However, organizing the labeling experience as a human-machine collaboration has the potential to improve label quality and reduce human effort. In this paper we describe a semi-automated data labeling system which employs a predictive model to guide and assist the human labeler. The model learns by observing labeling decisions, and is used to recommend labels and automate basic functions in the labeling interface. Agreement between the labeler and the model is tracked and presented via a system of checkpoints. At each checkpoint the labeler has the opportunity to delegate the remainder of the labeling task to the model.
引用
收藏
页码:156 / 169
页数:14
相关论文
共 50 条
  • [21] Hybrid data labeling algorithm for clustering large mixed type data
    Ravi Sankar Sangam
    Hari Om
    Journal of Intelligent Information Systems, 2015, 45 : 273 - 293
  • [22] A game-based framework for crowdsourced data labeling
    Yang, Jingru
    Fan, Ju
    Wei, Zhewei
    Li, Guoliang
    Liu, Tongyu
    Du, Xiaoyong
    VLDB JOURNAL, 2020, 29 (06) : 1311 - 1336
  • [23] A game-based framework for crowdsourced data labeling
    Jingru Yang
    Ju Fan
    Zhewei Wei
    Guoliang Li
    Tongyu Liu
    Xiaoyong Du
    The VLDB Journal, 2020, 29 : 1311 - 1336
  • [24] Classification framework and semantic labeling for Big Earth Data
    Wang, Juanle
    Bu, Kun
    Yan, Dongmei
    Wang, Jingyue
    Duan, Bowen
    Zhang, Min
    He, Guojin
    BIG EARTH DATA, 2023, 7 (03) : 886 - 903
  • [25] A Data Labeling method for Categorical Data Clustering using Cluster Entropies in Rough Sets
    Reddy, H. Venkateswara
    Kumar, B. Suresh
    Raju, S. Viswanadha
    2014 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2014, : 444 - 449
  • [26] Connecting Paper to Digitization-a Homework Data Processing System with Data Labeling and Visualization
    Li, Qingcheng
    Cao, Heng
    Lu, Ye
    PROCEEDINGS OF THE 14TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2017), 2017, : 504 - 510
  • [27] Selective genetic algorithm labeling: A new data labeling method for machine learning stock market trading systems
    Han, Yechan
    Kim, Jaeyun
    Enke, David
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 135
  • [28] Adaptive Model Scheduling for Resource-efficient Data Labeling
    Yuan, Mu
    Zhang, Lan
    Li, Xiang-Yang
    Yang, Lin-Zhuo
    Xiong, Hui
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (04)
  • [29] SMART: An Open Source Data Labeling Platform for Supervised Learning
    Chew, Rob
    Wenger, Michael
    Kery, Caroline
    Nance, Jason
    Richards, Keith
    Hadley, Emily
    Baumgartner, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20 : 1 - 5
  • [30] Scalabeling: Linear Slider Supported Labeling for the Classification of Streaming Data
    Steinmeier, Christine
    Budke, Jan
    Becking, Dominic
    IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 233 - 238