Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引:2
作者
Jiang, Hongxiang [1 ]
Wang, Qixiong [1 ]
Feng, Jiaqi [1 ]
Zhang, Guangyun [2 ]
Yin, Jihao [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);
D O I
10.1109/TGRS.2024.3423305
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.
引用
收藏
页数:17
相关论文
共 79 条
  • [71] Zeng Y, 2024, Arxiv, DOI arXiv:2303.04989
  • [72] Meta-DETR: Image-Level Few-Shot Detection With Inter-Class Correlation Exploitation
    Zhang, Gongjie
    Luo, Zhipeng
    Cui, Kaiwen
    Lu, Shijian
    Xing, Eric P.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 12832 - 12843
  • [73] Zhang H., 2023, PROC INT C LEARN REP, P1
  • [74] Zhang SF, 2020, PROC CVPR IEEE, P9756, DOI 10.1109/CVPR42600.2020.00978
  • [75] Few-Shot Object Detection With Self-Adaptive Global Similarity and Two-Way Foreground Stimulator in Remote Sensing Images
    Zhang, Yuchen
    Zhang, Bo
    Wang, Bin
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 7263 - 7276
  • [76] An Improved Aggregated-Mosaic Method for the Sparse Object Detection of Remote Sensing Imagery
    Zhao, Boya
    Wu, Yuanfeng
    Guan, Xinran
    Gao, Lianru
    Zhang, Bing
    [J]. REMOTE SENSING, 2021, 13 (13)
  • [77] Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning
    Zhao, Zhitao
    Tang, Ping
    Zhao, Lijun
    Zhang, Zheng
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [78] Few-Shot Object Detection via Context-Aware Aggregation for Remote Sensing Images
    Zhou, Yong
    Hu, Han
    Zhao, Jiaqi
    Zhu, Hancheng
    Yao, Rui
    Du, Wen-Liang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [79] Zhu Xizhou, 2021, ICLR, DOI 10.48550/arXiv.2010.04159