Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引:2
作者
Jiang, Hongxiang [1 ]
Wang, Qixiong [1 ]
Feng, Jiaqi [1 ]
Zhang, Guangyun [2 ]
Yin, Jihao [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);
D O I
10.1109/TGRS.2024.3423305
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.
引用
收藏
页数:17
相关论文
共 79 条
  • [21] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [22] Houlsby N, 2019, PR MACH LEARN RES, V97
  • [23] Hu E. J., 2022, PROC INT C LEARN REP, P1
  • [24] Relation Networks for Object Detection
    Hu, Han
    Gu, Jiayuan
    Zhang, Zheng
    Dai, Jifeng
    Wei, Yichen
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3588 - 3597
  • [25] Few-Shot Object Detection on Remote Sensing Images via Shared Attention Module and Balanced Fine-Tuning Strategy
    Huang, Xu
    He, Bokun
    Tong, Ming
    Wang, Dingwen
    He, Chu
    [J]. REMOTE SENSING, 2021, 13 (19)
  • [26] Lebedev V., 2015, PROC INT C LEARN REP, P1
  • [27] MM-RCNN: Toward Few-Shot Object Detection in Remote Sensing Images With Meta Memory
    Li, Jianxiang
    Tian, Yan
    Xu, Yiping
    Hu, Xinli
    Zhang, Zili
    Wang, Hu
    Xiao, Yiwen
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] Object detection in optical remote sensing images: A survey and a new benchmark
    Li, Ke
    Wan, Gang
    Cheng, Gong
    Meng, Liqiu
    Han, Junwei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 (159) : 296 - 307
  • [29] Li LZ, 2023, IEEE T GEOSCI REMOTE, V61, DOI [10.1109/TGRS.2023.3247642, 10.1109/TGRS.2023.3300071]
  • [30] Oriented RepPoints for Aerial Object Detection
    Li, Wentong
    Chen, Yijie
    Hu, Kaixuan
    Zhu, Jianke
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1819 - 1828