Robust Semi-Supervised Learning With Multi-Consistency and Data Augmentation

被引:2
作者
Guo, Jing-Ming [1 ]
Sun, Chi-Chia [2 ]
Chan, Kuan-Yu [1 ]
Liu, Chun-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 106, Taiwan
[2] Natl Formosa Univ, SIMI Ctr, Dept Elect Engn, Huwei 632, Taiwan
关键词
Semisupervised learning; Robustness; Task analysis; Semi-supervised learning; Animal-10N; Clothing1M; multi-consistency; augmentation; noisy labels learning;
D O I
10.1109/TCE.2023.3331700
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address the problem of noisy datasets by proposing a dual screening scheme to improve the performance of models trained on two public noisy datasets: Clothing1M and Animal-10N. As Web crawlers generate both datasets, their label error levels cannot be estimated. We use a warm-up model to separate the data into labeled and unlabeled data, which are then classified by multi-model consistency. We select consistent data from the dataset and provide pseudo-labels for training, while the remaining data is not trained as noisy data. This approach reduces the impact of noisy data and mislabeling. To improve the model's robustness, we combine clean data and unlabeled data with strong data augmentation and train them using the Mixup algorithm. Experimental results show that our proposed methods boost classification performance: the accuracy of Clothing1M is 0.1% higher than the state-of-the-art method, and the accuracy of Animal-10N is 2% higher than the state-of-the-art method. The significant contributions of this paper are: 1) adding strong data augmentation to enhance the model, 2) using multi-consistency to reduce the impact of noisy data, and 3) boosting performance through semi-supervised learning.
引用
收藏
页码:414 / 424
页数:11
相关论文
共 32 条
[1]  
Berthelot D, 2019, Arxiv, DOI [arXiv:1905.02249, DOI 10.48550/ARXIV.1905.02249]
[2]  
Berthelot D, 2019, ADV NEUR IN, V32
[3]  
Chen W., 2021, arXiv
[4]   Exploring Simple Siamese Representation Learning [J].
Chen, Xinlei ;
He, Kaiming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15745-15753
[5]  
Chen YY, 2022, Arxiv, DOI arXiv:2207.11971
[6]   Boosting Co-teaching with Compression Regularization for Label Noise [J].
Chen, Yingyi ;
Shen, Xi ;
Hu, Shell Xu ;
Suykens, Johan A. K. .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :2682-2686
[7]  
Chui K.T., 2023, Int. J. Intell. Netw., V4, P202
[8]  
Chui K.T., 2022, PROC INT C CYBER SEC, VVolume 599, P248, DOI 10.1007/978-3-031-22018-0_23
[9]   Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning [J].
Chui, Kwok Tai ;
Gupta, Brij B. ;
Zhao, Mingbo ;
Malibari, Areej ;
Arya, Varsha ;
Alhalabi, Wadee ;
Ruiz, Miguel Torres .
BIOENGINEERING-BASEL, 2022, 9 (11)
[10]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017