Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引:2
作者
Jiang, Hongxiang [1 ]
Wang, Qixiong [1 ]
Feng, Jiaqi [1 ]
Zhang, Guangyun [2 ]
Yin, Jihao [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);
D O I
10.1109/TGRS.2024.3423305
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.
引用
收藏
页数:17
相关论文
共 79 条
  • [61] GCWNet: A Global Context-Weaving Network for Object Detection in Remote Sensing Images
    Wu, Yulin
    Zhang, Ke
    Wang, Jingyu
    Wang, Yezi
    Wang, Qi
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [62] Oriented R-CNN for Object Detection
    Xie, Xingxing
    Cheng, Gong
    Wang, Jiabao
    Yao, Xiwen
    Han, Junwei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3500 - 3509
  • [63] Xu YC, 2021, IEEE T PATTERN ANAL, V43, P1452, DOI [10.1109/TPAMI.2020.2974745, 10.1109/TGRS.2020.3026387]
  • [64] Meta R-CNN : Towards General Solver for Instance-level Low-shot Learning
    Yan, Xiaopeng
    Chen, Ziliang
    Xu, Anni
    Wang, Xiaoxi
    Liang, Xiaodan
    Lin, Liang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9576 - 9585
  • [65] Orthogonality Loss: Learning Discriminative Representations for Face Recognition
    Yang, Shanming
    Deng, Weihong
    Wang, Mei
    Du, Junping
    Hu, Jiani
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2301 - 2314
  • [66] Yang X., 2021, P AAAI C ARTIFICIAL, P3163, DOI 10.1609/aaai.v35i4.16426
  • [67] Yang X, 2021, PR MACH LEARN RES, V139
  • [68] Yang Xue, 2021, Advances in Neural Information Processing Systems, V34
  • [69] RepPoints: Point Set Representation for Object Detection
    Yang, Ze
    Liu, Shaohui
    Hu, Han
    Wang, Liwei
    Lin, Stephen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9656 - 9665
  • [70] Yeh SY, 2024, Arxiv, DOI arXiv:2309.14859