Unlabeled data selection for active learning in image classification

被引:11
作者
Li, Xiongquan [1 ]
Wang, Xukang [2 ]
Chen, Xuhesheng [3 ]
Lu, Yao [4 ]
Fu, Hongpeng [5 ]
Wu, Ying Cheng [6 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China
[2] Sage IT Consulting Grp, Shanghai, Peoples R China
[3] Univ N Carolina, Chapel Hill, NC USA
[4] Univ Bristol, Bristol, England
[5] Northeastern Univ, Khoury Coll Comp Sci, Seattle, WA USA
[6] Univ Washington, Seattle, WA USA
基金
英国科研创新办公室;
关键词
D O I
10.1038/s41598-023-50598-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Active Learning has emerged as a viable solution for addressing the challenge of labeling extensive amounts of data in data-intensive applications such as computer vision and neural machine translation. The main objective of Active Learning is to automatically identify a subset of unlabeled data samples for annotation. This identification process is based on an acquisition function that assesses the value of each sample for model training. In the context of computer vision, image classification is a crucial task that typically requires a substantial training dataset. This research paper introduces innovative selection methods within the Active Learning framework, aiming to identify informative images from unlabeled datasets while minimizing the number of required training data. The proposed methods, namely Similari-ty-based Selection, Prediction Probability-based Selection, and Competence-based Active Learning, have been extensively evaluated through experiments conducted on popular datasets like Cifar10 and Cifar100. The experimental results demonstrate that the proposed methods outperform random selection and conventional selection techniques. The superior performance of the novel selection methods underscores their effectiveness in enhancing the Active Learning process for image classification tasks.
引用
收藏
页数:13
相关论文
共 33 条
[1]  
Alireza Golestaneh S., 2020, arXiv
[2]  
Ash Jordan T., 2020, Deep batch active learning by diverse, uncertain gradient lower bounds
[3]   The power of ensembles for active learning in image classification [J].
Beluch, William H. ;
Genewein, Tim ;
Nuernberger, Andreas ;
Koehler, Jan M. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9368-9377
[4]   Active learning for regression by inverse distance weighting [J].
Bemporad, Alberto .
INFORMATION SCIENCES, 2023, 626 :275-292
[5]  
Bengar J. Z., 2021, arXiv
[6]   Reducing Label Effort: Self-Supervised meets Active Learning [J].
Bengar, Javad Zolfaghari ;
van de Weijer, Joost ;
Twardowski, Bartlomiej ;
Raducanu, Bogdan .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :1631-1639
[7]  
Bengar Javad Zolfaghari, 2019, arXiv
[8]  
Biyik E, 2019, Arxiv, DOI arXiv:1906.07975
[9]   Hyperspectral Image Classification Promotion Using Clustering Inspired Active Learning [J].
Ding, Chen ;
Zheng, Mengmeng ;
Chen, Feixiong ;
Zhang, Yuankun ;
Zhuang, Xusi ;
Fan, Enquan ;
Wen, Dushi ;
Zhang, Lei ;
Wei, Wei ;
Zhang, Yanning .
REMOTE SENSING, 2022, 14 (03)
[10]  
Gal Y., 2017, ARXIV