Global-Margin Uncertainty and Collaborative Sampling for Active Learning in Complex Aerial Images Object Detection

被引:0
作者
Zhu, Dongjun [1 ]
Gu, Chengjie [1 ,2 ]
Zhang, Junjun [1 ]
Yao, Yuyou [1 ]
Tan, Dayu [3 ]
机构
[1] Anhui Univ Sci & Technol, Sch Publ Secur & Emergency Management, Huainan 232001, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Peoples R China
[3] Anhui Univ, Inst Phys Sci & Informat Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China
关键词
Uncertainty; Object detection; Entropy; Learning systems; Training; Collaboration; Annotations; Active learning; aerial images; collaborative sampling (CS); global-margin uncertainty (GMU); object detection;
D O I
10.1109/LGRS.2024.3373038
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection in aerial images based on deep learning requires a large amount of labeled data, whereas manual annotation of aerial images is time-consuming and laborious. As a branch of machine learning, active learning can help humans find valuable samples by designing some corresponding query strategies, which effectively reduces the cost of manual labeling. However, objects in aerial images are usually small, dense, and accompanied by interference from complex backgrounds. These bring considerable challenges for active learning in selecting high-value aerial image samples. Currently, there is a relative lack of study on active learning for aerial image object detection. Therefore, this letter proposes a novel active learning method, using global-margin uncertainty (GMU) and collaborative sampling (CS) to find out the highly valuable aerial image samples to reduce the annotation cost and improve the training efficiency of models. In GMU, the predicted scores of categories are applied to calculate the global uncertainty and margin uncertainty of unlabeled aerial images and then those aerial images with high uncertainty scores are selected as the candidate samples. In CS, we train a main model and an auxiliary model, respectively, to detect the candidate samples, where the samples with large differences in detection results of the two models are selected for manual annotation. The experiments conducted on VisDrone2019 and DOTA-v1.5 datasets show that the proposed method has a better performance compared with several state-of-the-art active learning methods.
引用
收藏
页数:5
相关论文
共 27 条
  • [1] Agarwal Sharat, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12361), P137, DOI 10.1007/978-3-030-58517-4_9
  • [2] Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P457
  • [3] A survey on active learning and human-in-the-loop deep learning for medical image analysis
    Budd, Samuel
    Robinson, Emma C.
    Kainz, Bernhard
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 71
  • [4] Dual-Aligned Oriented Detector
    Cheng, Gong
    Yao, Yanqing
    Li, Shengyang
    Li, Ke
    Xie, Xingxing
    Wang, Jiabao
    Yao, Xiwen
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Towards Fine-grained Sampling for Active Learning in Object Detection
    Desai, Sai Vikas
    Balasubramanian, Vineeth N.
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4010 - 4014
  • [6] Active Learning-Driven Siamese Network for Hyperspectral Image Classification
    Di, Xiyao
    Xue, Zhaohui
    Zhang, Mengxue
    [J]. REMOTE SENSING, 2023, 15 (03)
  • [7] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [8] Hacohen G, 2022, PR MACH LEARN RES
  • [9] Align Deep Features for Oriented Object Detection
    Han, Jiaming
    Ding, Jian
    Li, Jie
    Xia, Gui-Song
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778