Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引:2
|
作者
Jiang, Hongxiang [1 ]
Wang, Qixiong [1 ]
Feng, Jiaqi [1 ]
Zhang, Guangyun [2 ]
Yin, Jihao [1 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China
关键词
Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);
D O I
10.1109/TGRS.2024.3423305
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.
引用
收藏
页码:1 / 1
页数:17
相关论文
共 50 条
  • [21] Few-Shot Object Detection with Model Calibration
    Fan, Qi
    Tang, Chi-Keung
    Tai, Yu-Wing
    COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 720 - 739
  • [22] A Closer Look at Few-Shot Object Detection
    Liu, Yuhao
    Dong, Le
    He, Tengyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 430 - 447
  • [23] Few-Shot Object Detection: A Comprehensive Survey
    Koehler, Mona
    Eisenbach, Markus
    Gross, Horst-Michael
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 11958 - 11978
  • [24] Industrial few-shot fractal object detection
    Haoran Huang
    Xiaochuan Luo
    Chen Yang
    Neural Computing and Applications, 2023, 35 : 21055 - 21069
  • [25] Transformation Invariant Few-Shot Object Detection
    Li, Aoxue
    Li, Zhenguo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3093 - 3101
  • [26] Few-Shot Object Detection with Weight Imprinting
    Dingtian Yan
    Jitao Huang
    Hai Sun
    Fuqiang Ding
    Cognitive Computation, 2023, 15 : 1725 - 1735
  • [27] Few-Shot Object Detection with Foundation Models
    Han, Guangxing
    Lim, Ser-Nam
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 28608 - 28618
  • [28] Few-Shot Object Detection in Unseen Domains
    Guirguis, Karim
    Eskandar, George
    Kayser, Matthias
    Yang, Bin
    Beyerer, Juergen
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 98 - 107
  • [29] Improving Few-Shot Object Detection through a Performance Analysis on Aerial and Natural Images
    Le Jeune, Pierre
    Mokraoui, Anissa
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 513 - 517
  • [30] Experience feedback using Representation Learning for Few-Shot Object Detection on Aerial Images
    Le Jeune, Pierre
    Lebbah, Mustapha
    Mokraoui, Anissa
    Azzag, Hanene
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 662 - 667