GFENet: Generalization Feature Extraction Network for Few-Shot Object Detection

被引:0
作者
Ke, Xiao [1 ]
Chen, Qiuqin [1 ]
Liu, Hao [1 ]
Guo, Wenzhong [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fujian Prov Key Lab Networking Comp & Intelligent, Fuzhou 350116, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Data models; Object detection; Training; Adaptation models; Computational modeling; Shape; Transfer learning; few-shot learning; object detection; data augmentation; self-distillation;
D O I
10.1109/TCSVT.2024.3435977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot object detection achieves rapid detection of novel-class objects by training detectors with a minimal number of novel-class annotated instances. Transfer learning-based few-shot object detection methods have shown better performance compared to other methods such as meta-learning. However, when training with base-class data, the model may gradually bias towards learning the characteristics of each category in the base-class data, which could result in a decrease in learning ability during fine-tuning on novel classes, and further overfitting due to data scarcity. In this paper, we first find that the generalization performance of the base-class model has a significant impact on novel class detection performance and proposes a generalization feature extraction network framework to address this issue. This framework perturbs the base model during training to encourage it to learn generalization features and solves the impact of changes in object shape and size on overall detection performance, improving the generalization performance of the base model. Additionally, we propose a feature-level data augmentation method based on self-distillation to further enhance the overall generalization ability of the model. Our method achieves state-of-the-art results on both the COCO and PASCAL VOC datasets, with a 6.94% improvement on the PASCAL VOC 10-shot dataset.
引用
收藏
页码:12741 / 12755
页数:15
相关论文
共 76 条
[1]   Few-Shot Object Detection: A Survey [J].
Antonelli, Simone ;
Avola, Danilo ;
Cinque, Luigi ;
Crisostomi, Donato ;
Foresti, Gian Luca ;
Galasso, Fabio ;
Marini, Marco Raoul ;
Mecca, Alessio ;
Pannone, Daniele .
ACM COMPUTING SURVEYS, 2022, 54 (11S)
[2]   Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification [J].
Bai, Jing ;
Yuan, Anran ;
Xiao, Zhu ;
Zhou, Huaji ;
Wang, Dingchen ;
Jiang, Hongbo ;
Jiao, Licheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) :5474-5485
[3]   Anomaly Detection in Autonomous Driving: A Survey [J].
Bogdoll, Daniel ;
Nitsche, Maximilian ;
Zoellner, J. Marius .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :4487-4498
[4]  
Cao YH, 2021, ADV NEUR IN, V34
[5]   SD-FSOD: Self-Distillation Paradigm via Distribution Calibration for Few-Shot Object Detection [J].
Chen, Han ;
Wang, Qi ;
Xie, Kailin ;
Lei, Liang ;
Lin, Matthieu Gaetan ;
Lv, Tian ;
Liu, Yongjin ;
Luo, Jiebo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) :5963-5976
[6]   Towards Accurate One-Stage Object Detection with AP-Loss [J].
Chen, Kean ;
Li, Jianguo ;
Lin, Weiyao ;
See, John ;
Wang, Ji ;
Duan, Lingyu ;
Chen, Zhibo ;
He, Changwei ;
Zou, Junni .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5114-5122
[7]  
Chen Q., 2022, arXiv, DOI 10.48550/arXiv.2203.02270
[8]   Domain Adaptive Faster R-CNN for Object Detection in the Wild [J].
Chen, Yuhua ;
Li, Wen ;
Sakaridis, Christos ;
Dai, Dengxin ;
Van Gool, Luc .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3339-3348
[9]   Hybrid routing transformer for zero-shot learning [J].
Cheng, De ;
Wang, Gerong ;
Wang, Bo ;
Zhang, Qiang ;
Han, Jungong ;
Zhang, Dingwen .
PATTERN RECOGNITION, 2023, 137
[10]   Meta-Learning-Based Incremental Few-Shot Object Detection [J].
Cheng, Meng ;
Wang, Hanli ;
Long, Yu .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) :2158-2169