Imbalanced Deep Learning by Minority Class Incremental Rectification

被引：247

作者：

Dong, Qi ^{[1
]}

Gong, Shaogang ^{[1
]}

Zhu, Xiatian ^{[2
]}

机构：

[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England

[2] Vis Semant Ltd, London E1 4NS, England

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2019年 / 41卷 / 06期

基金：

“创新英国”项目;

关键词：

Class imbalanced deep learning; multi-label learning; inter-class boundary rectification; hard sample mining; facial attribute recognition; clothing attribute recognition; person attribute recognition; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; CLASSIFICATION; SMOTE;

D O I：

10.1109/TPAMI.2018.2832629

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model learning from class imbalanced training data is a long-standing and significant challenge for machine learning. In particular, existing deep learning methods consider mostly either class balanced data or moderately imbalanced data in model training, and ignore the challenge of learning from significantly imbalanced training data. To address this problem, we formulate a class imbalanced deep learning model based on batch-wise incremental minority (sparsely sampled) class rectification by hard sample mining in majority (frequently sampled) classes during model training. This model is designed to minimise the dominant effect of majority classes by discovering sparsely sampled boundaries of minority classes in an iterative batch-wise learning process. To that end, we introduce a Class Rectification Loss (CRL) function that can be deployed readily in deep network architectures. Extensive experimental evaluations are conducted on three imbalanced person attribute benchmark datasets (CelebA, X-Domain, DeepFashion) and one balanced object category benchmark dataset (CIFAR-100). These experimental results demonstrate the performance advantages and model scalability of the proposed batch-wise incremental minority class rectification model over the existing state-of-the-art models for addressing the problem of imbalanced data learning.

引用

页码：1367 / 1381

页数：15

共 83 条

[1] Applying support vector machines to imbalanced datasets [J].

Akbani, R ;

Kwek, S ;

Japkowicz, N .

MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 :39-50

[2]

Alejo R, 2006, LECT NOTES COMPUT SC, V4224, P464

[3]

Ando RK, 2005, J MACH LEARN RES, V6, P1817

[4]

[Anonymous], 2015, P INT C MACH LEARN

[5]

[Anonymous], PROC CVPR IEEE

[6]

[Anonymous], P INT C MULT RETR IC

[7]

[Anonymous], 2007, CALTECH 256 OBJECT C

[8]

[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386

[9]

[Anonymous], 2015, P 3 INT C LEARN REPR

[10]

[Anonymous], P IEEE C COMP VIS PA

← 1 2 3 4 5 6 7 8 9 →