Performance Boosting Mislabels Correction with Semi-Supervised Learning and Deep Feature Similarity Measurements

被引:0
|
作者
Sun, Chi-Chia [1 ]
Guo, Jing-Ming [2 ]
Lin, Jheng-Han [2 ]
Chang, Ting-Yu [2 ]
机构
[1] Natl Formosa Univ, Smart Machine & Intelligent Mfg Res Ctr, Dept Elect Engn, EE 64,Wunhua Rd, Huiwei 632, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 10607, Taiwan
关键词
Compendex;
D O I
10.2352/J.ImagingSci.Technol.2023.67.6.060501
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, dataset is applied from the perspective of semi-supervised learning, using a small amount of clean annotated data and combining a large amount of misannotated data for training. Clothing1M was used in the experiments. Therefore, the purpose of this study is to tackle the problem of noisy datasets to boost the models' performance. From the perspective of semi-supervised learning, the clean dataset is treated as the labeled dataset, and the remaining noisy data are regarded as the unlabeled data. The initial model was trained on the labeled dataset first, and then the model was used to perform feature extraction on the unlabeled dataset. The "prototypes" for each category can be obtained via feature matching and clustering. As a result, the dual screening scheme is proposed to take the model's predictions and the predictions from the prototypes method into account, reducing the impact of noisy data. The clean dataset after screening and the remaining data with noisy labels were trained by MixMatch to further enhance the robustness of models. Experimental results show that the proposed methods can boost the classification performance by 3% in accuracy, and outperform the state-of-the-art method by 1%. It achieves (1) cost reduction in labeling, (2) impact mitigation of noisy data via the dual screening scheme, and (3) performance boosting by semi-supervised learning. -c 2023 Society for Imaging Science and Technology.
引用
收藏
页数:8
相关论文
共 13 条
  • [1] Context Embedding Similarity based Semi-Supervised Active Learning for Time Series
    Zhou, Xianwei
    Lin, Yifan
    Yu, Songsen
    Wu, Shiqi
    Zhang, Wencong
    Zhang, Chulue
    Proceedings of the International Joint Conference on Neural Networks, 2024,
  • [2] Automated defect detection on inductive thermography images using supervised and semi-supervised Deep Learning methods
    Tout, Karim
    Samet, Naïm
    Bouteille, Patrick
    e-Journal of Nondestructive Testing, 2023, 28 (09):
  • [3] Semi-supervised learning for text-line detection
    Amazon.com, 701 Fifth Avenue, Seattle, WA 98104, United States
    Pattern Recogn. Lett., 11 (1260-1273):
  • [4] A semi-supervised human action recognition algorithm based on skeleton feature
    Yuan, Hejin
    Journal of Information Hiding and Multimedia Signal Processing, 2015, 6 (01): : 175 - 182
  • [5] Mutual consistency learning for semi-supervised medical image segmentation
    Wu, Yicheng
    Ge, Zongyuan
    Zhang, Donghao
    Xu, Minfeng
    Zhang, Lei
    Xia, Yong
    Cai, Jianfei
    Medical Image Analysis, 2022, 81
  • [6] ROAM: Random layer mixup for semi-supervised learning in medical images
    Bdair, Tariq
    Wiestler, Benedikt
    Navab, Nassir
    Albarqouni, Shadi
    IET Image Processing, 2023, 16 (10) : 2593 - 2608
  • [7] Supervised and semi-supervised sequence learning for recognition of requisite part and effectuation part in law sentences
    Nguyen, Le-Minh
    Bach, Ngo Xuan
    Shimazu, Akira
    FSMNLP 2011 - Proceedings of the 9th International Workshop Finite State Methods and Natural Language Processing, 2011, : 21 - 29
  • [8] IA-SSLM: Irregularity-Aware Semi-Supervised Deep Learning Model for Analyzing Unusual Events in Crowds
    Aljaloud, Abdulaziz Salamah
    Ullah, Habib
    IEEE Access, 2021, 9 : 73327 - 73334
  • [9] Performance of PCA based semi-supervised learning in face recognition using MPEG-7 edge histogram descriptor
    Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1000, Bangladesh
    J. Multimedia, 5 (404-415):
  • [10] Graph construction based on re-weighted sparse representation for semi-supervised learning
    Liu, X. (609370222@qq.com), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (10):