Class-Balanced Active Learning for Image Classification

被引:21
作者
Bengar, Javad Zolfaghari [1 ,2 ]
van de Weijer, Joost [1 ,2 ]
Fuentes, Laura Lopez [1 ]
Raducanu, Bogdan [1 ,2 ]
机构
[1] Comp Vis Ctr CVC, Barcelona, Spain
[2] Univ Autonoma Barcelona UAB, Barcelona, Spain
来源
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年
关键词
D O I
10.1109/WACV51458.2022.00376
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning aims to reduce the labeling effort that is required to train algorithms by learning an acquisition function selecting the most relevant data for which a label should be requested from a large unlabeled data pool. Active learning is generally studied on balanced datasets where an equal amount of images per class is available. However, real-world datasets suffer from severe imbalanced classes, the so called long-tail distribution. We argue that this further complicates the active learning process, since the unbalanced data pool can result in suboptimal classifiers. To address this problem in the context of active learning, we proposed a general optimization framework that explicitly takes class-balancing into account. Results on three datasets showed that the method is general (it can be combined with most existing active learning algorithms) and can be effectively applied to boost the performance of both informative and representative-based active learning methods. In addition, we showed that also on balanced datasets our method(1) generally results in a performance gain.
引用
收藏
页码:3707 / 3716
页数:10
相关论文
共 61 条
[21]  
Freytag A, 2014, LECT NOTES COMPUT SC, V8692, P562, DOI 10.1007/978-3-319-10593-2_37
[22]   Scalable Active Learning by Approximated Error Reduction [J].
Fu, Weijie ;
Wang, Meng ;
Hao, Shijie ;
Wu, Xindong .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1396-1405
[23]  
Gal Y, 2017, PR MACH LEARN RES, V70
[24]   Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks [J].
Gavves, E. ;
Mensink, T. ;
Tommasi, T. ;
Snoek, C. G. M. ;
Tuytelaars, T. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2731-2739
[25]  
Golestaneh S.A., 2020, BMVC
[26]   Gaseous air pollution and emergency hospital visits for hypertension in Beijing, China: a time-stratified case-crossover study [J].
Guo, Yuming ;
Tong, Shilu ;
Li, Shanshan ;
Barnett, Adrian G. ;
Yu, Weiwei ;
Zhang, Yanshen ;
Pan, Xiaochuan .
ENVIRONMENTAL HEALTH, 2010, 9
[27]  
Gurobi Optimization L., 2024, Gurobi Optimizer Reference Manual
[28]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[29]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[30]   Learning Deep Representation for Imbalanced Classification [J].
Huang, Chen ;
Li, Yining ;
Loy, Chen Change ;
Tang, Xiaoou .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5375-5384