Performance Boosting Mislabels Correction with Semi-Supervised Learning and Deep Feature Similarity Measurements

被引：0

作者：

Sun, Chi-Chia ^{[1
]}

Guo, Jing-Ming ^{[2
]}

Lin, Jheng-Han ^{[2
]}

Chang, Ting-Yu ^{[2
]}

机构：

[1] Natl Formosa Univ, Smart Machine & Intelligent Mfg Res Ctr, Dept Elect Engn, EE 64,Wunhua Rd, Huiwei 632, Taiwan

[2] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 10607, Taiwan

来源：

JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY | 2023年 / 67卷 / 06期

关键词：

Compendex;

D O I：

10.2352/J.ImagingSci.Technol.2023.67.6.060501

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

In this paper, dataset is applied from the perspective of semi-supervised learning, using a small amount of clean annotated data and combining a large amount of misannotated data for training. Clothing1M was used in the experiments. Therefore, the purpose of this study is to tackle the problem of noisy datasets to boost the models' performance. From the perspective of semi-supervised learning, the clean dataset is treated as the labeled dataset, and the remaining noisy data are regarded as the unlabeled data. The initial model was trained on the labeled dataset first, and then the model was used to perform feature extraction on the unlabeled dataset. The "prototypes" for each category can be obtained via feature matching and clustering. As a result, the dual screening scheme is proposed to take the model's predictions and the predictions from the prototypes method into account, reducing the impact of noisy data. The clean dataset after screening and the remaining data with noisy labels were trained by MixMatch to further enhance the robustness of models. Experimental results show that the proposed methods can boost the classification performance by 3% in accuracy, and outperform the state-of-the-art method by 1%. It achieves (1) cost reduction in labeling, (2) impact mitigation of noisy data via the dual screening scheme, and (3) performance boosting by semi-supervised learning. -c 2023 Society for Imaging Science and Technology.

引用

页数：8

共 13 条

[1] Context Embedding Similarity based Semi-Supervised Active Learning for Time Series
Zhou, Xianwei
Lin, Yifan
Yu, Songsen
Wu, Shiqi
Zhang, Wencong
Zhang, Chulue
Proceedings of the International Joint Conference on Neural Networks, 2024,
[2] Automated defect detection on inductive thermography images using supervised and semi-supervised Deep Learning methods
Tout, Karim
Samet, Naïm
Bouteille, Patrick
e-Journal of Nondestructive Testing, 2023, 28 (09):
[3] Semi-supervised learning for text-line detection
Amazon.com, 701 Fifth Avenue, Seattle, WA 98104, United States
Pattern Recogn. Lett., 11 (1260-1273):
[4] A semi-supervised human action recognition algorithm based on skeleton feature
Yuan, Hejin
Journal of Information Hiding and Multimedia Signal Processing, 2015, 6 (01): : 175 - 182
[5] Mutual consistency learning for semi-supervised medical image segmentation
Wu, Yicheng
Ge, Zongyuan
Zhang, Donghao
Xu, Minfeng
Zhang, Lei
Xia, Yong
Cai, Jianfei
Medical Image Analysis, 2022, 81
[6] ROAM: Random layer mixup for semi-supervised learning in medical images
Bdair, Tariq
Wiestler, Benedikt
Navab, Nassir
Albarqouni, Shadi
IET Image Processing, 2023, 16 (10) : 2593 - 2608
[7] Supervised and semi-supervised sequence learning for recognition of requisite part and effectuation part in law sentences
Nguyen, Le-Minh
Bach, Ngo Xuan
Shimazu, Akira
FSMNLP 2011 - Proceedings of the 9th International Workshop Finite State Methods and Natural Language Processing, 2011, : 21 - 29
[8] IA-SSLM: Irregularity-Aware Semi-Supervised Deep Learning Model for Analyzing Unusual Events in Crowds
Aljaloud, Abdulaziz Salamah
Ullah, Habib
IEEE Access, 2021, 9 : 73327 - 73334
[9] Performance of PCA based semi-supervised learning in face recognition using MPEG-7 edge histogram descriptor
Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1000, Bangladesh
J. Multimedia, 5 (404-415):
[10] Graph construction based on re-weighted sparse representation for semi-supervised learning
Liu, X. (609370222@qq.com), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (10):

← 1 2 →