Balanced Orthogonal Subspace Separation Detector for Few-Shot Object Detection in Aerial Imagery

被引：2

作者：

Jiang, Hongxiang ^{[1
]}

Wang, Qixiong ^{[1
]}

Feng, Jiaqi ^{[1
]}

Zhang, Guangyun ^{[2
]}

Yin, Jihao ^{[1
]}

机构：

[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China

[2] Nanjing Tech Univ, Ctr Remote Sensing, Nanjing 300072, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Remote sensing; Detectors; Object detection; Training; Feature extraction; Metalearning; Task analysis; Adapter tuning; disentanglement representation; few-shot object detection (FSOD); orthogonal subspace learning; remote sensing images (RSIs);

D O I：

10.1109/TGRS.2024.3423305

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Few-shot object detection (FSOD) in remote sensing images (RSIs) aims to achieve object location and classification with only a few training samples. Currently, mainstream transfer-learning methods employ a two-stage approach: pretraining on data-abundant base classes and fine-tuning on few-shot novel classes. However, existing approaches suffer notable degradation in both base and novel classes during fine-tuning, because of gradient conflict and class imbalance. To address this, we construct the balanced orthogonal subspace separation (BOSS) detector, a novel two-stage framework for FSOD. Specifically, to avoid contradictory gradients, BOSS distinctly isolates the training of base and novel classes at both structural and feature levels. For structural separation, a low-rank subspace adapter (LoSA) is introduced to ensure network optimization for novel classes without hampering base classes' pretraining performance, effectively addressing over-fitting in few-shot scenarios. For feature disentanglement, an orthogonal subspace extractor (OSE) is presented, enhancing class separability by learning class-specific, orthogonal basis-spanned subspace. Finally, a balanced classifier (BC) is proposed to equalize the imbalanced loss, with its dual-component design mitigating bias toward predicting background or base classes. Comparative evaluations on diverse remote sensing datasets demonstrate BOSS's superiority, outperforming state-of-the-art methods in mean average precision (mAP). These results underscore BOSS's effectiveness in FSOD, particularly in challenging remote sensing contexts.

引用

页数：17

共 79 条

[61] GCWNet: A Global Context-Weaving Network for Object Detection in Remote Sensing Images
Wu, Yulin
Zhang, Ke
Wang, Jingyu
Wang, Yezi
Wang, Qi
Li, Xuelong
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[62] Oriented R-CNN for Object Detection
Xie, Xingxing
Cheng, Gong
Wang, Jiabao
Yao, Xiwen
Han, Junwei
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3500 - 3509
[63] Xu YC, 2021, IEEE T PATTERN ANAL, V43, P1452, DOI [10.1109/TPAMI.2020.2974745, 10.1109/TGRS.2020.3026387]
[64] Meta R-CNN : Towards General Solver for Instance-level Low-shot Learning
Yan, Xiaopeng
Chen, Ziliang
Xu, Anni
Wang, Xiaoxi
Liang, Xiaodan
Lin, Liang
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9576 - 9585
[65] Orthogonality Loss: Learning Discriminative Representations for Face Recognition
Yang, Shanming
Deng, Weihong
Wang, Mei
Du, Junping
Hu, Jiani
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2301 - 2314
[66] Yang X., 2021, P AAAI C ARTIFICIAL, P3163, DOI 10.1609/aaai.v35i4.16426
[67] Yang X, 2021, PR MACH LEARN RES, V139
[68] Yang Xue, 2021, Advances in Neural Information Processing Systems, V34
[69] RepPoints: Point Set Representation for Object Detection
Yang, Ze
Liu, Shaohui
Hu, Han
Wang, Liwei
Lin, Stephen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9656 - 9665
[70] Yeh SY, 2024, Arxiv, DOI arXiv:2309.14859

← 1 2 3 4 5 6 7 8 →