Semi-Automated Data Labeling

被引:0
|
作者
Desmond, Michael [1 ]
Duesterwald, Evelyn [1 ]
Brimijoin, Kristina [1 ]
Brachman, Michelle [1 ]
Pan, Qian [1 ]
机构
[1] IBM Thomas J Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
关键词
Data Labeling; Human Computer Interaction; Interactive Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Labeling data is often a tedious and error-prone activity. However, organizing the labeling experience as a human-machine collaboration has the potential to improve label quality and reduce human effort. In this paper we describe a semi-automated data labeling system which employs a predictive model to guide and assist the human labeler. The model learns by observing labeling decisions, and is used to recommend labels and automate basic functions in the labeling interface. Agreement between the labeler and the model is tracked and presented via a system of checkpoints. At each checkpoint the labeler has the opportunity to delegate the remainder of the labeling task to the model.
引用
收藏
页码:156 / 169
页数:14
相关论文
共 50 条
  • [31] A semi-automated hybrid schema matching framework for vegetation data integration
    Asif-Ur-Rahman, Md
    Hossain, Bayzid Ashik
    Bewong, Michael
    Islam, Md Zahidul
    Zhao, Yanchang
    Groves, Jeremy
    Judith, Rory
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [32] Semi-Automated Nonresponse Detection for Open-Text Survey Data
    Cibelli Hibben, Kristen
    Smith, Zachary
    Rogers, Benjamin
    Ryan, Valerie
    Scanlon, Paul
    Hoppe, Travis
    SOCIAL SCIENCE COMPUTER REVIEW, 2025, 43 (01) : 166 - 190
  • [33] Using data-driven algorithms for semi-automated geomorphological mapping
    Giaccone, Elisa
    Oriani, Fabio
    Tonini, Marj
    Lambiel, Christophe
    Mariethoz, Gregoire
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (08) : 2115 - 2131
  • [34] An approach for semi-automated data quality assurance within BIM models
    Cann S.
    Mahamadu A.-M.
    Prabhakaran A.
    Dziekonski K.
    Joseph R.
    Engineering Management in Production and Services, 2022, 14 (04) : 114 - 125
  • [35] A semi-automated approach to validation and error diagnostics of water network data
    Kirstein, Jonas Kjeld
    Hogh, Klavs
    Rygaard, Martin
    Borup, Morten
    URBAN WATER JOURNAL, 2019, 16 (01) : 1 - 10
  • [36] Semi-automated sensitivity analysis to assess systematic errors in observational data
    Lash, TL
    Fink, AK
    EPIDEMIOLOGY, 2003, 14 (04) : 451 - 458
  • [37] Semi-automated harmonization and selection of chemical data for risk and impact assessment
    Aurisano, Nicolo
    Fantke, Peter
    CHEMOSPHERE, 2022, 302
  • [38] Semi-Automated Data Classification with Feature Weighted Self Organizing Map
    Starkey, Andrew
    Ahmad, Aliyu Usman
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 136 - 141
  • [39] Semi-automated quality assurance of deformable registration in CT radiotherapy data
    Mcgrath, T.
    Lawrence, Z.
    Salih, R. Farhad
    Peters, Y.
    Rawling, J.
    Wilson, M.
    Piazzese, C.
    Holloway, S.
    RADIOTHERAPY AND ONCOLOGY, 2019, 133 : S1171 - S1171
  • [40] A semi-automated hybrid schema matching framework for vegetation data integration
    Asif-Ur-Rahman, Md
    Hossain, Bayzid Ashik
    Bewong, Michael
    Islam, Md Zahidul
    Zhao, Yanchang
    Groves, Jeremy
    Judith, Rory
    Expert Systems with Applications, 2023, 229