Remix: Rebalanced Mixup

被引:182
作者
Chou, Hsin-Ping [1 ]
Chang, Shih-Chieh [1 ]
Pan, Jia-Yu [2 ]
Wei, Wei [2 ]
Juan, Da-Cheng [2 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Google Res, Mountain View, CA USA
来源
COMPUTER VISION - ECCV 2020 WORKSHOPS, PT VI | 2020年 / 12540卷
关键词
Imbalanced data; Mixup; Regularization; Image classification;
D O I
10.1007/978-3-030-65414-6_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep image classifiers often perform poorly when training data are heavily class-imbalanced. In this work, we propose a new regularization technique, Remix, that relaxes Mixup's formulation and enables the mixing factors of features and labels to be disentangled. Specifically, when mixing two samples, while features are mixed in the same fashion as Mixup, Remix assigns the label in favor of the minority class by providing a disproportionately higher weight to the minority class. By doing so, the classifier learns to push the decision boundaries towards the majority classes and balance the generalization error between majority and minority classes. We have studied the state-of-the art regularization techniques such as Mixup, Manifold Mixup and CutMix under class-imbalanced regime, and shown that the proposed Remix significantly outperforms these state-of-the-arts and several re-weighting and re-sampling techniques, on the imbalanced datasets constructed by CIFAR-10, CIFAR-100, and CINIC-10. We have also evaluated Remix on a real-world large-scale imbalanced dataset, iNaturalist 2018. The experimental results confirmed that Remix provides consistent and significant improvements over the previous methods.
引用
收藏
页码:95 / 110
页数:16
相关论文
共 38 条
[1]  
Berthelot D, 2019, ADV NEUR IN, V32
[2]   Loss Max-Pooling for Semantic Image Segmentation [J].
Bulo, Samuel Rota ;
Neuhold, Gerhard ;
Kontschieder, Peter .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7082-7091
[3]  
Byrd J., 2018, ICML
[4]  
Cao KD, 2019, ADV NEUR IN, V32
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]  
Chung Y.A., 2015, IJCAI
[7]   Class-Balanced Loss Based on Effective Number of Samples [J].
Cui, Yin ;
Jia, Menglin ;
Lin, Tsung-Yi ;
Song, Yang ;
Belongie, Serge .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269
[8]  
Darlow L. N., 2018, arXiv
[9]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]  
DeVries T, 2017, Arxiv, DOI arXiv:1708.04552