Sample Balancing for Deep Learning-Based Visual Recognition

被引:15
作者
Chen, Xin [1 ]
Weng, Jian [1 ]
Luo, Weiqi [1 ]
Lu, Wei [2 ]
Wu, Huimin [1 ]
Xu, Jiaming [3 ]
Tian, Qi [4 ]
机构
[1] Jinan Univ, Coll Informat Sci & Technol, Guangzhou 510632, Peoples R China
[2] Sun Yat Sen Univ, Guangdong Key Lab Informat Secur Technol, Minist Educ, Key Lab Machine Intelligence & Adv Comp,Sch Data, Guangzhou 510006, Peoples R China
[3] Baiyun Dist Bur Justice, Guangzhou 510405, Peoples R China
[4] Huawei, Noahs Ark Lab, Shenzhen 518129, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Training; Visualization; Semantics; Deep learning; Measurement; Task analysis; Image recognition; image classification; sample reweighting; sample selection; self-paced learning; ALGORITHMS; FEATURES; OUTLIERS;
D O I
10.1109/TNNLS.2019.2947789
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sample balancing includes sample selection and sample reweighting. Sample selection aims to remove some bad samples that may lead to bad local optima. Sample reweighting aims to assign optimal weights to samples to improve performance. In this article, we integrate a sample selection method based on self-paced learning into deep learning frameworks and study the influence of different sample selection strategies on training deep networks. In addition, most of the existing sample reweighting methods mainly take per-class sample number as a metric, which does not fully consider sample qualities. To improve the performance, we propose a novel metric based on the multiview semantic encoders to reweight the samples more appropriately. Then, we propose an optimization mechanism to embed sample weights into loss functions of deep networks, which can be trained in end-to-end manners. We conduct experiments on the CIFAR data set and the ImageNet data set. The experimental results demonstrate that our proposed sample balancing method can improve the performances of deep learning methods in several visual recognition tasks.
引用
收藏
页码:3962 / 3976
页数:15
相关论文
共 83 条
[1]   Deep learning approach for active classification of electrocardiogram signals [J].
Al Rahhal, M. M. ;
Bazi, Yakoub ;
AlHichri, Haikel ;
Alajlan, Naif ;
Melgani, Farid ;
Yager, R. R. .
INFORMATION SCIENCES, 2016, 345 :340-354
[2]  
Angiulli F., 2002, Principles of Data Mining and Knowledge Discovery. 6th European Conference, PKDD 2002. Proceedings (Lecture Notes in Artificial Intelligence Vol.2431), P15
[3]  
[Anonymous], 2008, P 14 ACM SIGKDD INT
[4]  
[Anonymous], 2009, COMPUT SCI TECH REP
[5]  
[Anonymous], 1994, Outliers in statistical data
[6]  
[Anonymous], IEEE T PATTERN ANAL
[7]  
[Anonymous], 2016, ARXIV PREPRINT ARXIV
[8]  
[Anonymous], 2015, Tiny ImageNet Visual Recognition Challenge., DOI DOI 10.1109/ICCV.2015.123
[9]  
[Anonymous], 2006, DATA MINING CONCEPTS
[10]  
[Anonymous], 2005, KDD