Semi-Automated Data Labeling

被引:0
|
作者
Desmond, Michael [1 ]
Duesterwald, Evelyn [1 ]
Brimijoin, Kristina [1 ]
Brachman, Michelle [1 ]
Pan, Qian [1 ]
机构
[1] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
来源
NEURIPS 2020 COMPETITION AND DEMONSTRATION TRACK, VOL 133 | 2020年 / 133卷
关键词
Data Labeling; Human Computer Interaction; Interactive Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Labeling data is often a tedious and error-prone activity. However, organizing the labeling experience as a human-machine collaboration has the potential to improve label quality and reduce human effort. In this paper we describe a semi-automated data labeling system which employs a predictive model to guide and assist the human labeler. The model learns by observing labeling decisions, and is used to recommend labels and automate basic functions in the labeling interface. Agreement between the labeler and the model is tracked and presented via a system of checkpoints. At each checkpoint the labeler has the opportunity to delegate the remainder of the labeling task to the model.
引用
收藏
页码:156 / 169
页数:14
相关论文
共 50 条
  • [1] Semi-Automated Data Labeling for Activity Recognition in Pervasive Healthcare
    Cruz-Sandoval, Dagoberto
    Beltran-Marquez, Jessica
    Garcia-Constantino, Matias
    Gonzalez-Jasso, Luis A.
    Favela, Jesus
    Hussein Lopez-Nava, Irvin
    Cleland, Ian
    Ennis, Andrew
    Hernandez-Cruz, Netzahualcoyotl
    Rafferty, Joseph
    Synnott, Jonathan
    Nugent, Chris
    SENSORS, 2019, 19 (14)
  • [2] Semi-automated content zoning of spam emails
    Brucks, Claudine
    Hilker, Michael
    Schommer, Christoph
    Wagner, Cynthia
    Weires, Ralph
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2008, 8 : 35 - 44
  • [3] Comparing Semi-Automated Clustering Methods for Persona Development
    Brickey, Jonalan
    Walczak, Steven
    Burgess, Tony
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2012, 38 (03) : 537 - 546
  • [4] Rule-Enhanced Active Learning for Semi-Automated Weak Supervision
    Kartchner, David
    An, Davi Nakajima
    Ren, Wendi
    Zhang, Chao
    Mitchell, Cassie S.
    AI, 2022, 3 (01) : 211 - 228
  • [5] A model based approach to semi-automated user interface generation for process control interactive applications
    Moussa, F
    Kolski, C
    Riahi, M
    INTERACTING WITH COMPUTERS, 2000, 12 (03) : 245 - 279
  • [6] Multimodal semi-automated affect detection from conversational cues, gross body language, and facial features
    Sidney K. D’Mello
    Arthur Graesser
    User Modeling and User-Adapted Interaction, 2010, 20 : 147 - 187
  • [7] Multimodal semi-automated affect detection from conversational cues, gross body language, and facial features
    D'Mello, Sidney K.
    Graesser, Arthur
    USER MODELING AND USER-ADAPTED INTERACTION, 2010, 20 (02) : 147 - 187
  • [8] Human-machine collaboration on data annotation of images by semi-automatic labeling
    Haider, Tom
    Michahelles, Florian
    MENSCH AND COMPUTER 2021 (MUC 21), 2021, : 552 - 556
  • [9] On data labeling for clustering categorical data
    Chen, Hung-Leng
    Chuang, Kun-Ta
    Chen, Ming-Syan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (11) : 1458 - 1471
  • [10] OneLabeler: A Flexible System for Building Data Labeling Tools
    Zhang, Yu
    Wang, Yun
    Zhang, Haidong
    Zhu, Bin
    Chen, Siming
    Zhang, Dongmei
    PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,