Global-Margin Uncertainty and Collaborative Sampling for Active Learning in Complex Aerial Images Object Detection

被引：0

作者：

Zhu, Dongjun ^{[1
]}

Gu, Chengjie ^{[1
,2
]}

Zhang, Junjun ^{[1
]}

Yao, Yuyou ^{[1
]}

Tan, Dayu ^{[3
]}

机构：

[1] Anhui Univ Sci & Technol, Sch Publ Secur & Emergency Management, Huainan 232001, Peoples R China

[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Peoples R China

[3] Anhui Univ, Inst Phys Sci & Informat Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2024年 / 21卷

关键词：

Uncertainty; Object detection; Entropy; Learning systems; Training; Collaboration; Annotations; Active learning; aerial images; collaborative sampling (CS); global-margin uncertainty (GMU); object detection;

D O I：

10.1109/LGRS.2024.3373038

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Object detection in aerial images based on deep learning requires a large amount of labeled data, whereas manual annotation of aerial images is time-consuming and laborious. As a branch of machine learning, active learning can help humans find valuable samples by designing some corresponding query strategies, which effectively reduces the cost of manual labeling. However, objects in aerial images are usually small, dense, and accompanied by interference from complex backgrounds. These bring considerable challenges for active learning in selecting high-value aerial image samples. Currently, there is a relative lack of study on active learning for aerial image object detection. Therefore, this letter proposes a novel active learning method, using global-margin uncertainty (GMU) and collaborative sampling (CS) to find out the highly valuable aerial image samples to reduce the annotation cost and improve the training efficiency of models. In GMU, the predicted scores of categories are applied to calculate the global uncertainty and margin uncertainty of unlabeled aerial images and then those aerial images with high uncertainty scores are selected as the candidate samples. In CS, we train a main model and an auxiliary model, respectively, to detect the candidate samples, where the samples with large differences in detection results of the two models are selected for manual annotation. The experiments conducted on VisDrone2019 and DOTA-v1.5 datasets show that the proposed method has a better performance compared with several state-of-the-art active learning methods.

引用

页数：5

共 27 条

[1] Agarwal Sharat, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12361), P137, DOI 10.1007/978-3-030-58517-4_9
[2] Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P457
[3] A survey on active learning and human-in-the-loop deep learning for medical image analysis
Budd, Samuel
Robinson, Emma C.
Kainz, Bernhard
[J]. MEDICAL IMAGE ANALYSIS, 2021, 71
[4] Dual-Aligned Oriented Detector
Cheng, Gong
Yao, Yanqing
Li, Shengyang
Li, Ke
Xie, Xingxing
Wang, Jiabao
Yao, Xiwen
Han, Junwei
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5] Towards Fine-grained Sampling for Active Learning in Object Detection
Desai, Sai Vikas
Balasubramanian, Vineeth N.
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4010 - 4014
[6] Active Learning-Driven Siamese Network for Hyperspectral Image Classification
Di, Xiyao
Xue, Zhaohui
Zhang, Mengxue
[J]. REMOTE SENSING, 2023, 15 (03)
[7] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
[8] Hacohen G, 2022, PR MACH LEARN RES
[9] Align Deep Features for Oriented Object Detection
Han, Jiaming
Ding, Jian
Li, Jie
Xia, Gui-Song
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 →