GFENet: Generalization Feature Extraction Network for Few-Shot Object Detection

被引:0
|
作者
Ke, Xiao [1 ]
Chen, Qiuqin [1 ]
Liu, Hao [1 ]
Guo, Wenzhong [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Data models; Object detection; Training; Adaptation models; Computational modeling; Shape; Transfer learning; few-shot learning; object detection; data augmentation; self-distillation;
D O I
10.1109/TCSVT.2024.3435977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot object detection achieves rapid detection of novel-class objects by training detectors with a minimal number of novel-class annotated instances. Transfer learning-based few-shot object detection methods have shown better performance compared to other methods such as meta-learning. However, when training with base-class data, the model may gradually bias towards learning the characteristics of each category in the base-class data, which could result in a decrease in learning ability during fine-tuning on novel classes, and further overfitting due to data scarcity. In this paper, we first find that the generalization performance of the base-class model has a significant impact on novel class detection performance and proposes a generalization feature extraction network framework to address this issue. This framework perturbs the base model during training to encourage it to learn generalization features and solves the impact of changes in object shape and size on overall detection performance, improving the generalization performance of the base model. Additionally, we propose a feature-level data augmentation method based on self-distillation to further enhance the overall generalization ability of the model. Our method achieves state-of-the-art results on both the COCO and PASCAL VOC datasets, with a 6.94% improvement on the PASCAL VOC 10-shot dataset.
引用
收藏
页码:12741 / 12755
页数:15
相关论文
共 50 条
  • [31] Few-Shot Object Detection with Foundation Models
    Han, Guangxing
    Lim, Ser-Nam
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 28608 - 28618
  • [32] Few-Shot Object Detection in Unseen Domains
    Guirguis, Karim
    Eskandar, George
    Kayser, Matthias
    Yang, Bin
    Beyerer, Juergen
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 98 - 107
  • [33] Class-based Core Feature Extraction Network for Few-shot Classification
    Zhang, Xianchao
    Shuang, Yifei
    Zhang, Xiaotong
    Liu, Han
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2102 - 2108
  • [34] Better Class Feature Representation for Few-Shot Object Detection: Feature Aggregation and Feature Space Redistribution
    Zhang, Wen
    Xu, Yuping
    Chen, Guorui
    Li, Zhijiang
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2023, 67 (02)
  • [35] A Feature Extraction Method Based on Few-shot Learning
    Liu, Sa
    Pang, Shanmin
    Zhu, Li
    Zhao, Jiakun
    2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING (ICAICE 2020), 2020, : 528 - 532
  • [36] IMPROVING FEW-SHOT OBJECT DETECTION WITH OBJECT PART PROPOSALS
    Chevalley, Arthur
    Tomoiaga, Ciprian
    Detyniecki, Marcin
    Russwurm, Marc
    Tuia, Devis
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6502 - 6505
  • [37] Decoupled Metric Network for Single-Stage Few-Shot Object Detection
    Lu, Yue
    Chen, Xingyu
    Wu, Zhengxing
    Yu, Junzhi
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) : 514 - 525
  • [38] A few-shot object detection method for garbage via variational autoencoders and feature aggregation
    Xue, Shuya
    Song, Dian
    Chen, Wei
    Zhao, Lei
    Zhou, Qian
    WASTE MANAGEMENT, 2025, 200
  • [39] Few-Shot Object Detection Based on the Transformer and High-Resolution Network
    Zhang, Dengyong
    Pu, Huaijian
    Li, Feng
    Ding, Xiangling
    Sheng, Victor S.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 3439 - 3454
  • [40] Category-Contextual Relation Encoding Network for Few-Shot Object Detection
    Yin, Ating
    Wang, Yaonan
    Mao, Jianxu
    Zhang, Hui
    Chen, Xiuyi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8355 - 8367