Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引:2
作者
Jiang, Hongxiang [1 ]
Wang, Qixiong [1 ]
Feng, Jiaqi [1 ]
Zhang, Guangyun [2 ]
Yin, Jihao [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);
D O I
10.1109/TGRS.2024.3423305
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.
引用
收藏
页数:17
相关论文
共 79 条
  • [41] Libra R-CNN: Towards Balanced Learning for Object Detection
    Pang, Jiangmiao
    Chen, Kai
    Shi, Jianping
    Feng, Huajun
    Ouyang, Wanli
    Lin, Dahua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 821 - 830
  • [42] Pfeiffer J, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P487
  • [43] Stacked Autoencoders Driven by Semi-Supervised Learning for Building Extraction from near Infrared Remote Sensing Imagery
    Protopapadakis, Eftychios
    Doulamis, Anastasios
    Doulamis, Nikolaos
    Maltezos, Evangelos
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 24
  • [44] DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection
    Qiao, Limeng
    Zhao, Yuxuan
    Li, Zhiyuan
    Qiu, Xi
    Wu, Jianan
    Zhang, Chi
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8661 - 8670
  • [45] Radford A, 2021, PR MACH LEARN RES, V139
  • [46] Orthogonal Projection Loss
    Ranasinghe, Kanchana
    Naseer, Muzammal
    Hayat, Munawar
    Khan, Salman
    Khan, Fahad Shahbaz
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12313 - 12323
  • [47] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [48] Rücklé A, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P7930
  • [49] Adaptive Subspaces for Few-Shot Learning
    Simon, Christian
    Koniusz, Piotr
    Nock, Richard
    Harandi, Mehrtash
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4135 - 4144
  • [50] FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding
    Sun, Bo
    Li, Banghuai
    Cai, Shengcai
    Yuan, Ye
    Zhang, Chi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7348 - 7358