Semi-Automated Data Labeling

被引:0
|
作者
Desmond, Michael [1 ]
Duesterwald, Evelyn [1 ]
Brimijoin, Kristina [1 ]
Brachman, Michelle [1 ]
Pan, Qian [1 ]
机构
[1] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
来源
NEURIPS 2020 COMPETITION AND DEMONSTRATION TRACK, VOL 133 | 2020年 / 133卷
关键词
Data Labeling; Human Computer Interaction; Interactive Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Labeling data is often a tedious and error-prone activity. However, organizing the labeling experience as a human-machine collaboration has the potential to improve label quality and reduce human effort. In this paper we describe a semi-automated data labeling system which employs a predictive model to guide and assist the human labeler. The model learns by observing labeling decisions, and is used to recommend labels and automate basic functions in the labeling interface. Agreement between the labeler and the model is tracked and presented via a system of checkpoints. At each checkpoint the labeler has the opportunity to delegate the remainder of the labeling task to the model.
引用
收藏
页码:156 / 169
页数:14
相关论文
共 50 条
  • [41] A New Approach to Telecommunications Network Design Automated and Data Driven
    Kauker, Fabion
    Forbes, Chris
    Blair, Matthew
    Huffman, Danny
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: SUPPORTING LEARNING, DECISION-MAKING AND COLLABORATION, HCI INTERNATIONAL 2017, PT II, 2017, 10274 : 371 - 389
  • [42] An Automated Data Cleaning Framework for Improving Facial Expression Classification
    Elebiary, Anis
    Aathreya, Saandeep
    Canavan, Shaun
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [43] Traffic data extraction and labeling for machine learning based attack detection in IoT networks
    Hayelom Gebrye
    Yong Wang
    Fagen Li
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2317 - 2332
  • [44] Strong versus Weak Data Labeling for Artificial Intelligence Algorithms in the Measurement of Geographic Atrophy
    Domalpally, Amitha
    Slater, Robert
    Linderman, Rachel E.
    Balaji, Rohit
    Bogost, Jacob
    Voland, Rick
    Pak, Jeong
    Blodi, Barbara A.
    Channa, Roomasa
    Fong, Donald
    Chew, Emily Y.
    OPHTHALMOLOGY SCIENCE, 2024, 4 (05):
  • [45] Kern: A Labeling Environment for Large-Scale, High-Quality Training Data
    Hoetter, Johannes
    Wenck, Henrik
    Feuerpfeil, Moritz
    Witzke, Simon
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 502 - 507
  • [46] Traffic data extraction and labeling for machine learning based attack detection in IoT networks
    Gebrye, Hayelom
    Wang, Yong
    Li, Fagen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2317 - 2332
  • [47] KGRED: Knowledge-graph-based rule discovery for weakly supervised data labeling
    Hou, Wenjun
    Hong, Liang
    Zhu, Ziyi
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [48] A Weakly Supervised Data Labeling Framework for Machine Lexical Normalization in Vietnamese Social Media
    Nguyen, Dung Ha
    Nguyen, Anh Thi Hoang
    Nguyen, Kiet Van
    COGNITIVE COMPUTATION, 2025, 17 (01)
  • [49] A Compound Fault Labeling and Diagnosis Method Based on Flight Data and BIT Record of UAV
    Zheng, Ke
    Jia, Guozhu
    Yang, Linchao
    Wang, Jiaqing
    APPLIED SCIENCES-BASEL, 2021, 11 (12):
  • [50] Tool for Semi-Automatic Segmentation and Labeling of Regions of Interest on Mid-Lateral Oblique Screen-Film Mammograms
    Cortes, F.
    Vera, A.
    Leija, L.
    Ortega-Palacios, R.
    Gomez, W.
    Ramos, A.
    Bazan, I.
    2023 GLOBAL MEDICAL ENGINEERING PHYSICS EXCHANGES/PACIFIC HEALTH CARE ENGINEERING, GMEPE/PAHCE, 2023,