Performance Boosting Mislabels Correction with Semi-Supervised Learning and Deep Feature Similarity Measurements

被引:0
|
作者
Sun, Chi-Chia [1 ]
Guo, Jing-Ming [2 ]
Lin, Jheng-Han [2 ]
Chang, Ting-Yu [2 ]
机构
[1] Natl Formosa Univ, Smart Machine & Intelligent Mfg Res Ctr, Dept Elect Engn, EE 64,Wunhua Rd, Huiwei 632, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 10607, Taiwan
关键词
Compendex;
D O I
10.2352/J.ImagingSci.Technol.2023.67.6.060501
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, dataset is applied from the perspective of semi-supervised learning, using a small amount of clean annotated data and combining a large amount of misannotated data for training. Clothing1M was used in the experiments. Therefore, the purpose of this study is to tackle the problem of noisy datasets to boost the models' performance. From the perspective of semi-supervised learning, the clean dataset is treated as the labeled dataset, and the remaining noisy data are regarded as the unlabeled data. The initial model was trained on the labeled dataset first, and then the model was used to perform feature extraction on the unlabeled dataset. The "prototypes" for each category can be obtained via feature matching and clustering. As a result, the dual screening scheme is proposed to take the model's predictions and the predictions from the prototypes method into account, reducing the impact of noisy data. The clean dataset after screening and the remaining data with noisy labels were trained by MixMatch to further enhance the robustness of models. Experimental results show that the proposed methods can boost the classification performance by 3% in accuracy, and outperform the state-of-the-art method by 1%. It achieves (1) cost reduction in labeling, (2) impact mitigation of noisy data via the dual screening scheme, and (3) performance boosting by semi-supervised learning. -c 2023 Society for Imaging Science and Technology.
引用
收藏
页数:8
相关论文
共 13 条
  • [11] Semi-supervised learning approach for construction object detection by integrating super-resolution and mean teacher network
    Zhang, Wen-Jie
    Wan, Hua-Ping
    Hu, Peng-Hua
    Ge, Hui-Bin
    Luo, Yaozhi
    Todd, Michael D.
    Journal of Infrastructure Intelligence and Resilience, 2024, 3 (04):
  • [12] Semi-supervised Fault Diagnosis Method via Graph Label Propagation and Discriminative Feature Enhancement for Critical Components of Industrial Robot
    Han, Te
    Li, Yanfu
    Lei, Yaguo
    Li, Naipeng
    Li, Xiang
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2022, 58 (17): : 116 - 124
  • [13] Combining BERT Model with Semi-Supervised Incremental Learning for Heterogeneous Knowledge Fusion of High-Speed Railway On-Board System
    Zhou, Lu-Jie
    Zhao, Zhi-Peng
    Dang, Jian-Wu
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022