GFENet: Generalization Feature Extraction Network for Few-Shot Object Detection

被引:0
作者
Ke, Xiao [1 ]
Chen, Qiuqin [1 ]
Liu, Hao [1 ]
Guo, Wenzhong [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Data models; Object detection; Training; Adaptation models; Computational modeling; Shape; Transfer learning; few-shot learning; object detection; data augmentation; self-distillation;
D O I
10.1109/TCSVT.2024.3435977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot object detection achieves rapid detection of novel-class objects by training detectors with a minimal number of novel-class annotated instances. Transfer learning-based few-shot object detection methods have shown better performance compared to other methods such as meta-learning. However, when training with base-class data, the model may gradually bias towards learning the characteristics of each category in the base-class data, which could result in a decrease in learning ability during fine-tuning on novel classes, and further overfitting due to data scarcity. In this paper, we first find that the generalization performance of the base-class model has a significant impact on novel class detection performance and proposes a generalization feature extraction network framework to address this issue. This framework perturbs the base model during training to encourage it to learn generalization features and solves the impact of changes in object shape and size on overall detection performance, improving the generalization performance of the base model. Additionally, we propose a feature-level data augmentation method based on self-distillation to further enhance the overall generalization ability of the model. Our method achieves state-of-the-art results on both the COCO and PASCAL VOC datasets, with a 6.94% improvement on the PASCAL VOC 10-shot dataset.
引用
收藏
页码:12741 / 12755
页数:15
相关论文
共 76 条
  • [1] Few-Shot Object Detection: A Survey
    Antonelli, Simone
    Avola, Danilo
    Cinque, Luigi
    Crisostomi, Donato
    Foresti, Gian Luca
    Galasso, Fabio
    Marini, Marco Raoul
    Mecca, Alessio
    Pannone, Daniele
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [2] Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification
    Bai, Jing
    Yuan, Anran
    Xiao, Zhu
    Zhou, Huaji
    Wang, Dingchen
    Jiang, Hongbo
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 5474 - 5485
  • [3] Anomaly Detection in Autonomous Driving: A Survey
    Bogdoll, Daniel
    Nitsche, Maximilian
    Zoellner, J. Marius
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4487 - 4498
  • [4] Cao YH, 2021, ADV NEUR IN, V34
  • [5] SD-FSOD: Self-Distillation Paradigm via Distribution Calibration for Few-Shot Object Detection
    Chen, Han
    Wang, Qi
    Xie, Kailin
    Lei, Liang
    Lin, Matthieu Gaetan
    Lv, Tian
    Liu, Yongjin
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5963 - 5976
  • [6] Towards Accurate One-Stage Object Detection with AP-Loss
    Chen, Kean
    Li, Jianguo
    Lin, Weiyao
    See, John
    Wang, Ji
    Duan, Lingyu
    Chen, Zhibo
    He, Changwei
    Zou, Junni
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5114 - 5122
  • [7] Chen Q., 2022, arXiv, DOI 10.48550/arXiv.2203.02270
  • [8] Domain Adaptive Faster R-CNN for Object Detection in the Wild
    Chen, Yuhua
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
  • [9] Hybrid routing transformer for zero-shot learning
    Cheng, De
    Wang, Gerong
    Wang, Bo
    Zhang, Qiang
    Han, Jungong
    Zhang, Dingwen
    [J]. PATTERN RECOGNITION, 2023, 137
  • [10] Meta-Learning-Based Incremental Few-Shot Object Detection
    Cheng, Meng
    Wang, Hanli
    Long, Yu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2158 - 2169