Feature Space Augmentation for Long-Tailed Data

被引：169

作者：

Chu, Peng ^{[1
]}

Bian, Xiao ^{[2
]}

Liu, Shaopeng ^{[3
]}

Ling, Haibin ^{[1
,4
]}

机构：

[1] Temple Univ, Philadelphia, PA USA

[2] Google Inc, Mountain View, CA USA

[3] GE Res, Niskayuna, NY USA

[4] SUNY Stony Brook, Stony Brook, NY USA

来源：

COMPUTER VISION - ECCV 2020, PT XXIX | 2020年 / 12374卷

关键词：

D O I：

10.1007/978-3-030-58526-6_41

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Real-world data often follow a long-tailed distribution as the frequency of each class is typically different. For example, a dataset can have a large number of under-represented classes and a few classes with more than sufficient data. However, a model to represent the dataset is usually expected to have reasonably homogeneous performances across classes. Introducing class-balanced loss and advanced methods on data re-sampling and augmentation are among the best practices to alleviate the data imbalance problem. However, the other part of the problem about the under-represented classes will have to rely on additional knowledge to recover the missing information. In this work, we present a novel approach to address the long-tailed problem by augmenting the under-represented classes in the feature space with the features learned from the classes with ample samples. In particular, we decompose the features of each class into a class-generic component and a class-specific component using class activation maps. Novel samples of under-represented classes are then generated on the fly during training stages by fusing the class-specific features from the under-represented classes with the class-generic features from confusing classes. Our results on different datasets such as iNaturalist, ImageNetLT, Places-LT and a long-tailed version of CIFAR have shown the state of the art performances.

引用

页码：694 / 710

页数：17

共 50 条

[1]

Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911

[2] Sharing Representations for Long Tail Computer Vision Problems [J].

Bengio, Samy .

ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, :1-1

[3]

Bian X, 2016, IEEE WINT CONF APPL

[4] A systematic study of the class imbalance problem in convolutional neural networks [J].

Buda, Mateusz ;

Maki, Atsuto ;

Mazurowski, Maciej A. .

NEURAL NETWORKS, 2018, 106 :249-259

[5] SMOTE: Synthetic minority over-sampling technique [J].

Chawla, Nitesh V. ;

Bowyer, Kevin W. ;

Hall, Lawrence O. ;

Kegelmeyer, W. Philip .

2002, American Association for Artificial Intelligence (16)

[6] DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving [J].

Chen, Chenyi ;

Seff, Ari ;

Kornhauser, Alain ;

Xiao, Jianxiong .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2722-2730

[7] Destruction and Construction Learning for Fine-grained Image Recognition [J].

Chen, Yue ;

Bai, Yalong ;

Zhang, Wei ;

Mei, Tao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161

[8]

Chen ZT, 2019, AAAI CONF ARTIF INTE, P3379

[9] Class-Balanced Loss Based on Effective Number of Samples [J].

Cui, Yin ;

Jia, Menglin ;

Lin, Tsung-Yi ;

Song, Yang ;

Belongie, Serge .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269

[10]

Drummond C, 2003, WORKSH LEARN IMB DAT, V11, P1

← 1 2 3 4 5 →