Interactive Supervision with t-SNE

被引:1
|
作者
Luus, Francois [1 ]
Khan, Naweed [1 ]
Akhalwaya, Ismail [1 ]
机构
[1] IBM Res, Armonk, NY 10504 USA
来源
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE (K-CAP '19) | 2019年
关键词
semi-supervised learning; active learning; dimensionality reduction;
D O I
10.1145/3360901.3364414
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Knowledge capture from human experts in domain-specific settings can benefit from incisive use of machine intelligence to reduce expended time and effort. Such a capability can be of significant value to deep learning, given its demand for large labeled data. We propose an ML-based system for interactive labeling of image datasets to speed up class attribution performed by domain experts. The tool visualizes feature spaces and makes it directly editable through online integration of applied labels. We propose realistic annotation emulation to evaluate the system design of interactive active learning, based on our improved semi-supervised extension of t-SNE dimensionality reduction. We contribute globally normalized attractions, semi-supervised repulsion, smoothed label integration, and parameter optimization in our improved t-SNE. Our active learning tool can significantly increase labeling efficiency compared to uncertainty sampling, and we show that less than 100 labeling actions are typically sufficient for good classification on a variety of specialized image datasets. Our contribution is unique given that it needs to perform dimensionality reduction, feature space visualization and editing, interactive label propagation, low-complexity active learning, human perceptual modeling, annotation emulation and unsupervised feature extraction for specialized datasets in a production-quality implementation.
引用
收藏
页码:85 / 92
页数:8
相关论文
共 50 条
  • [41] t-SNE Visualization of Large-Scale Neural Recordings
    Dimitriadis, George
    Neto, Joana P.
    Kampff, Adam R.
    NEURAL COMPUTATION, 2018, 30 (07) : 1750 - 1774
  • [42] Deep learning and t-SNE projection for plankton images clusterization
    Homsi Goulart, Antonio Jose
    Morimitsu, Alexandre
    Jacomassi, Renan
    Hirata, Nina
    Lopes, Rubens
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [43] Revised Conditional t-SNE: Looking Beyond the Nearest Neighbors
    Heiter, Edith
    Kang, Bo
    Seurinck, Ruth
    Lijffijt, Jefrey
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023, 2023, 13876 : 169 - 181
  • [44] Dimensionality Reduction via Dynamical Systems: The Case of t-SNE
    Linderman, George C.
    Steinerberger, Stefan
    SIAM REVIEW, 2022, 64 (01) : 153 - 178
  • [45] Confidence estimation for t-SNE embeddings using random forest
    Yigin, Busra Ozgode
    Saygili, Gorkem
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (12) : 3981 - 3992
  • [46] Accelerating t-SNE using tree-based algorithms
    Van Der Maaten, Laurens
    Journal of Machine Learning Research, 2015, 15 : 3221 - 3245
  • [47] A t-SNE Based Classification Approach to Compositional Microbiome Data
    Xu, Xueli
    Xie, Zhongming
    Yang, Zhenyu
    Li, Dongfang
    Xu, Ximing
    FRONTIERS IN GENETICS, 2020, 11
  • [48] Using Global t-SNE to Preserve Intercluster Data Structure
    Zhou, Yuansheng
    Sharpee, Tatyana O.
    NEURAL COMPUTATION, 2022, 34 (08) : 1637 - 1651
  • [49] The art of using t-SNE for single-cell transcriptomics
    Kobak, Dmitry
    Berens, Philipp
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [50] Confidence estimation for t-SNE embeddings using random forest
    Busra Ozgode Yigin
    Gorkem Saygili
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 3981 - 3992