Learning Primitive-Aware Discriminative Representations for Few-Shot Learning

被引：0

作者：

Yang, Jianpeng ^{[1
]}

Niu, Yuhang ^{[1
]}

Xie, Xuemei ^{[1
,2
]}

Shi, Guangming ^{[1
]}

机构：

[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China

[2] Xidian Univ, Guangzhou Inst Technol, Xian, Peoples R China

来源：

NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II | 2024年 / 14448卷

关键词：

Few-shot Learning; Visual Primitive; Graph Convolution; Episodic Training;

D O I：

10.1007/978-981-99-8082-6_11

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot Learning (FSL) aims to learn a classifier that can be easily adapted to recognize novel classes with only a few labeled examples. Recently, some works about FSL have yielded promising classification performance, where the image-level feature is used to calculate the similarity among samples for classification. However, the image-level feature ignores abundant fine-grained and structural information of objects that could be transferable and consistent between seen and unseen classes. How can humans easily identify novel classes with several samples? Some studies from cognitive science argue that humans recognize novel categories based on primitives. Although base and novel categories are non-overlapping, they share some primitives in common. Inspired by above research, we propose a Primitive Mining andReasoning Network (PMRN) to learn primitive-aware representations based onmetric-based FSL model. Concretely, we first add Self-supervision Jigsaw task (SSJ) for feature extractor parallelly, guiding the model encoding visual pattern corresponding to object parts into feature channels. Moreover, to mine discriminative representations, an Adaptive Channel Grouping (ACG) method is applied to cluster and weight spatially and semantically related visual patterns to generate a set of visual primitives. To further enhance the discriminability and transferability of primitives, we propose a visual primitive Correlation Reasoning Network (CRN) based on Graph Convolutional network to learn abundant structural information and internal correlation among primitives. Finally, a primitive-level metric is conducted for classification in a meta-task based on episodic training strategy. Extensive experiments show that our method achieves state-of-the-art results on miniImageNet and Caltech-UCSD Birds.

引用

页码：131 / 146

页数：16

共 30 条

[1] Antoniou A, 2019, ADV NEUR IN, V32
[2] Ashok A., 2021, ML Reproducibility Challenge
[3] Easy-Ensemble Augmented-Shot-Y-Shaped Learning: State-of-the-Art Few-Shot Classification with Simple Components
Bendou, Yassir
Hu, Yuqing
Lafargue, Raphael
Lioi, Giulia
Pasdeloup, Bastien
Pateux, Stephane
Gripon, Vincent
[J]. JOURNAL OF IMAGING, 2022, 8 (07)
[4] Dong CAQ, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P716
[5] Finn C, 2017, PR MACH LEARN RES, V70
[6] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[7] Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
He, Yangji
Liang, Weihan
Zhao, Dongyang
Zhou, Hong-Yu
Ge, Weifeng
Yu, Yizhou
Zhang, Wenqiang
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9109 - 9119
[8] Hou RB, 2019, ADV NEUR IN, V32
[9] Hu P, 2019, Arxiv, DOI arXiv:1906.04833
[10] King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001

← 1 2 3 →