Correlated Input-Dependent Label Noise in Large-Scale Image Classification

被引:28
作者
Collier, Mark [1 ]
Mustafa, Basil [1 ]
Kokiopoulou, Efi [1 ]
Jenatton, Rodolphe [1 ]
Berent, Jesse [1 ]
机构
[1] Google AI, Mountain View, CA 94043 USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large scale image classification datasets often contain noisy labels. We take a principled probabilistic approach to modelling input-dependent, also known as heteroscedastic, label noise in these datasets. We place a multivariate Nor-mal distributed latent variable on the final hidden layer of a neural network classifier. The covariance matrix of this latent variable, models the aleatoric uncertainty due to label noise. We demonstrate that the learned covariance structure captures known sources of label noise between semantically similar and co-occurring classes. Compared to standard neural network training and other baselines, we show significantly improved accuracy on Imagenet ILSVRC 2012 79.3% (+ 2.6%), Imagenet-21k 47.0% (+ 1.1%) and JFT 64.7% (+ 1.6%). We set a new state-of-the-art result on WebVision 1.0 with 76.6% top-1 accuracy. These datasets range from over 1M to over 300M training examples and from 1k classes to more than 21k classes. Our method is simple to use, and we provide an implementation that is a drop-in replacement for the final fully-connected layer in a deep classifier.
引用
收藏
页码:1551 / 1560
页数:10
相关论文
共 47 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2018, PR MACH LEARN RES
[3]  
[Anonymous], 2017, INT C MACH LEARN, DOI DOI 10.1109/DSC.2017.89
[4]  
Bishop CM, 1997, ADV NEUR IN, V9, P347
[5]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[6]  
Bolduc Moshe Ben-AkiWand Denis, 1996, TECHNICAL REPORT
[7]  
Cao K., 2020, ARXIV200615766
[8]  
Collier Mark, ARXIV PREPRINT ARXIV
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Djolonga Josip, 2020, ARXIV200708558