Dynamic Latent Feature Guidance for Few-Shot Object Detection

被引:0
作者
Yao, Xinwei [1 ]
Liu, Jun [1 ]
Li, Qiang [1 ]
Zhang, Hengcong [1 ]
Tu, Zitao [1 ]
机构
[1] Zhejiang Univ Technol, Sch Comp Sci & Technol, Hangzhou 310014, Peoples R China
关键词
Feature extraction; Object detection; Image reconstruction; Detectors; Training; Accuracy; Metalearning; Few shot learning; Transformers; Prototypes; Few-shot object detection (FSOD); meta-learning; multiscale context; variational autoencoder (VAE);
D O I
10.1109/TII.2025.3575898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot object detection usually faces the challenge of imbalanced data distribution. The limited training data for novel classes not only leads to insufficient representation of support features but also biases the detector toward base classes. To address these problems, we propose a novel dynamic latent feature guidance method. First, the latent feature reconstruction module utilizes a variational autoencoder to reconstruct query and support features, extracting additional information representations from the latent space, thereby enriching feature representation and compensating for the information deficiencies caused by limited samples. Second, we design the dynamic multiscale similarity guidance module, which highlights information relevant to query images and suppresses background noise and occlusion interference through global, regional, and local similarities. Extensive experimental results demonstrate that our proposed method significantly improves detection accuracy on the PASCAL VOC and MS COCO datasets, outperforming existing state-of-the-art methods.
引用
收藏
页数:9
相关论文
共 38 条
[21]   Transferrable Prototypical Networks for Unsupervised Domain Adaptation [J].
Pan, Yingwei ;
Yao, Ting ;
Li, Yehao ;
Wang, Yu ;
Ngo, Chong-Wah ;
Mei, Tao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2234-2242
[22]   DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection [J].
Qiao, Limeng ;
Zhao, Yuxuan ;
Li, Zhiyuan ;
Qiu, Xi ;
Wu, Jianan ;
Zhang, Chi .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :8661-8670
[23]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[24]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[25]  
Santoro A, 2016, PR MACH LEARN RES, V48
[26]  
Snell J, 2017, ADV NEUR IN, V30
[27]   FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding [J].
Sun, Bo ;
Li, Banghuai ;
Cai, Shengcai ;
Yuan, Ye ;
Zhang, Chi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7348-7358
[28]  
Wang X., 2020, INT C MACH LEARN
[29]   Meta-Learning to Detect Rare Objects [J].
Wang, Yu-Xiong ;
Ramanan, Deva ;
Hebert, Martial .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9924-9933
[30]   Multi-scale Positive Sample Refinement for Few-Shot Object Detection [J].
Wu, Jiaxi ;
Liu, Songtao ;
Huang, Di ;
Wang, Yunhong .
COMPUTER VISION - ECCV 2020, PT XVI, 2020, 12361 :456-472