Learn to aggregate global and local representations for few-shot learning

被引：0

作者：

Mounir Abdelaziz

Zuping Zhang

机构：

[1] Central South University,School of Computer Science & Engineering

来源：

Multimedia Tools and Applications | 2023年 / 82卷

关键词：

Few-shot learning; Metric learning; Deep nearest neighbors; Class prototypes; Euclidean distance; Cosine similarity;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Few-shot learning aims to train recognition models to learn new object categories from limited training examples. Recent metric-learning based methods have made significant progress. Most of these methods rely on a single similarity metric at a global or local level. However, classifying samples using multiple similarity metrics at different levels simultaneously can produce a better similarity measure and more discriminative feature maps. Therefore, in this paper, a novel method called Learn to Aggregate Global and Local Representations for Few-shot Learning is introduced. Our proposed method embeds the support images and the query images. Then, it calculates four distinct similarity metrics between representations at global and local levels. Finally, the calculated similarities are combined and fed to a fusion module to obtain a final similarity score. Extensive experiments demonstrate that our method achieves state-of-the-art results on popular benchmarks. Particularly, AGLRs outperforms DN4 with a margin of ≈ 3 − 4% on the miniImageNet dataset.

引用

页码：32991 / 33014

页数：23

共 17 条

[1]

Abdelaziz M(2021)Few-shot learning with saliency maps as additional visual information Multimed Tools Appl 80 10491-10508

[2]

Zhang Z(1987)Recognition-by-components: a theory of human image understanding Psychol Rev 94 115-147

[3]

Biederman I(2019)Multi-Level Semantic feature augmentation for One-Shot learning IEEE Trans Image Process 28 4594-4605

[4]

Chen Z(2006)One-shot learning of object categories IEEE Trans Pattern Anal Mach Intell 28 594-611

[5]

Fu Y(2017)Imagenet classification with deep convolutional neural networks Communications of The ACM 60 84-90

[6]

Zhang Y(2015)Imagenet large scale visual recognition challenge Int J Comput Vis 115 211-252

[7]

Fei-Fei L(2002)A perspective view and survey of meta-learning Artif Intell Rev 18 77-95

[8]

Fergus R(undefined)undefined undefined undefined undefined-undefined

[9]

Perona P(undefined)undefined undefined undefined undefined-undefined

[10]

Krizhevsky A(undefined)undefined undefined undefined undefined-undefined

← 1 2 →