SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy Labels

被引:4
作者
Kim, Daehwan [1 ]
Ryoo, Kwangrok [2 ,3 ]
Cho, Hansang [1 ]
Kim, Seungryong [4 ]
机构
[1] Samsung Electromech, 150 Meyoung Ro, Suwon, Gyeonggi, South Korea
[2] Korea Univ, 145 Anam Ro, Seoul, South Korea
[3] LG AI Res, 128 Yeoui Daero, Seoul, South Korea
[4] Korea Adv Inst Sci & Technol, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Deep learning; Learning with noisy labels; Semi-supervised learning; Clean-noisy label splitting;
D O I
10.1007/s11263-024-02187-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Annotating the dataset with high-quality labels is crucial for deep networks' performance, but in real-world scenarios, the labels are often contaminated by noise. To address this, some methods were recently proposed to automatically split clean and noisy labels among training data, and learn a semi-supervised learner in a Learning with Noisy Labels (LNL) framework. However, they leverage a handcrafted module for clean-noisy label splitting, which induces a confirmation bias in the semi-supervised learning phase and limits the performance. In this paper, for the first time, we present a learnable module for clean-noisy label splitting, dubbed SplitNet, and a novel LNL framework which complementarily trains the SplitNet and main network for the LNL task. We also propose to use a dynamic threshold based on split confidence by SplitNet to optimize the semi-supervised learner better. To enhance SplitNet training, we further present a risk hedging method. Our proposed method performs at a state-of-the-art level, especially in high noise ratio settings on various LNL benchmarks.
引用
收藏
页码:549 / 566
页数:18
相关论文
共 79 条
[1]  
Agarap A F., Deep Learning using Rectified Linear Units, DOI DOI 10.48550/ARXIV.1803.08375
[2]  
[Anonymous], 2021, CIFAR-N
[3]  
[Anonymous], 2018, Food-101N
[4]  
[Anonymous], 2021, CIFAR-IDN
[5]  
Arazo E, 2019, PR MACH LEARN RES, V97
[6]  
Arpit D, 2017, PR MACH LEARN RES, V70
[7]  
Bai Yingbin, 2021, Advances in Neural Information Processing Systems
[8]  
Berthelot D, 2019, ADV NEUR IN, V32
[9]  
Berthelot David., 2019, arXiv
[10]  
Chen MC, 2022, Arxiv, DOI arXiv:2112.02960