Bias oriented unbiased data augmentation for cross-bias representation learning

被引:0
作者
Li, Lei [1 ,2 ]
Tang, Fan [1 ,2 ]
Cao, Juan [1 ,2 ]
Li, Xirong [3 ]
Wang, Danding [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Renmin Univ China, Beijing 100872, Peoples R China
关键词
Cross-bias generalization; Data augmentation; Unbiased representation;
D O I
10.1007/s00530-022-01013-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The biased cues in the training data may build strong connections between specific targets and unexpected concepts, leading the learned representations could not be applied to real-world data that does not contain the same biased cues. To learn cross-bias representations which can generalize on unbiased datasets by only using biased data, researchers focus on reducing the influence of biased cues through unbiased sampling or augmentation on the basis of artificial experience. However, the distributions of biased cues in the dataset are neglected, which limits the performance of these methods. In this paper, we propose a bias oriented data augmentation to enhance the cross-bias generalization by enlarging "safety" and "unbiasedness" constraints in the training data without manual prior intervention. The safety constraint is proposed to maintain the class-specific information for augmentation while the unbiasedness constraint reduces the statistical correlation of bias information and class labels. Experiments under different biased proportions on four synthetic/real-world datasets show that the proposed approach could improve the performance of other SOTA debiasing approaches (colored MNIST: 0.35-26.14%, corrupted CIFAR10: 3.14-8.44%, BFFHQ: 1.50% and BAR: 1.72%).
引用
收藏
页码:725 / 738
页数:14
相关论文
共 43 条
[11]  
Hendrycks D., 2019, arXiv
[12]   Deflating Dataset Bias Using Synthetic Data Augmentation [J].
Jaipuria, Nikita ;
Zhang, Xianling ;
Bhasin, Rohan ;
Arafa, Mayar ;
Chakravarty, Punarjay ;
Shrivastava, Shubham ;
Manglani, Sagar ;
Murali, Vidya N. .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3344-3353
[13]  
Kang G., 2017, ARXIV
[14]   A Style-Based Generator Architecture for Generative Adversarial Networks [J].
Karras, Tero ;
Laine, Samuli ;
Aila, Timo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405
[15]   Learning Not to Learn: Training Deep Neural Networks with Biased Data [J].
Kim, Byungju ;
Kim, Hyunwoo ;
Kim, Kyungsu ;
Kim, Sungjin ;
Kim, Junmo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9004-9012
[16]   BiaSwap: Removing Dataset Bias with Bias-Tailored Swapping Augmentation [J].
Kim, Eungyeup ;
Lee, Jihyeon ;
Choo, Jaegul .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :14972-14981
[17]   Analyzing and Reducing the Damage of Dataset Bias to Face Recognition with Synthetic Data [J].
Kortylewski, Adam ;
Egger, Bernhard ;
Schneider, Andreas ;
Gerig, Thomas ;
Morel-Forster, Andreas ;
Vetter, Thomas .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :2261-2268
[18]  
Krizhevsky A., 2009, Technical Report TR-2009
[19]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[20]  
Lee Jason D., 2021, Advances in Neural Information Processing Systems, V34