Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引:2
作者
Jiang, Hongxiang [1 ]
Wang, Qixiong [1 ]
Feng, Jiaqi [1 ]
Zhang, Guangyun [2 ]
Yin, Jihao [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);
D O I
10.1109/TGRS.2024.3423305
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.
引用
收藏
页数:17
相关论文
共 79 条
  • [1] Aghajanyan A., 2021, P 59 ANN M ASS COMPU, V1, P7319
  • [2] Anh-Huy Phan, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12374), P522, DOI 10.1007/978-3-030-58526-6_31
  • [3] Baur Christoph, 2017, Medical Image Computing and Computer Assisted Intervention MICCAI 2017. 20th International Conference. Proceedings: LNCS 10435, P311, DOI 10.1007/978-3-319-66179-7_36
  • [4] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [5] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [6] Chen H, 2018, AAAI CONF ARTIF INTE, P2836
  • [7] Chen SF, 2022, ADV NEUR IN
  • [8] Chen Z., 2023, PROC INT C LEARN REP, P1
  • [9] Prototype-CNN for Few-Shot Object Detection in Remote Sensing Images
    Cheng, Gong
    Yan, Bowei
    Shi, Peizhen
    Li, Ke
    Yao, Xiwen
    Guo, Lei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] Multi-class geospatial object detection and geographic image classification based on collection of part detectors
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Guo, Lei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 : 119 - 132