Learning Primitive-Aware Discriminative Representations for Few-Shot Learning

被引:0
作者
Yang, Jianpeng [1 ]
Niu, Yuhang [1 ]
Xie, Xuemei [1 ,2 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
[2] Xidian Univ, Guangzhou Inst Technol, Xian, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II | 2024年 / 14448卷
关键词
Few-shot Learning; Visual Primitive; Graph Convolution; Episodic Training;
D O I
10.1007/978-981-99-8082-6_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot Learning (FSL) aims to learn a classifier that can be easily adapted to recognize novel classes with only a few labeled examples. Recently, some works about FSL have yielded promising classification performance, where the image-level feature is used to calculate the similarity among samples for classification. However, the image-level feature ignores abundant fine-grained and structural information of objects that could be transferable and consistent between seen and unseen classes. How can humans easily identify novel classes with several samples? Some studies from cognitive science argue that humans recognize novel categories based on primitives. Although base and novel categories are non-overlapping, they share some primitives in common. Inspired by above research, we propose a Primitive Mining andReasoning Network (PMRN) to learn primitive-aware representations based onmetric-based FSL model. Concretely, we first add Self-supervision Jigsaw task (SSJ) for feature extractor parallelly, guiding the model encoding visual pattern corresponding to object parts into feature channels. Moreover, to mine discriminative representations, an Adaptive Channel Grouping (ACG) method is applied to cluster and weight spatially and semantically related visual patterns to generate a set of visual primitives. To further enhance the discriminability and transferability of primitives, we propose a visual primitive Correlation Reasoning Network (CRN) based on Graph Convolutional network to learn abundant structural information and internal correlation among primitives. Finally, a primitive-level metric is conducted for classification in a meta-task based on episodic training strategy. Extensive experiments show that our method achieves state-of-the-art results on miniImageNet and Caltech-UCSD Birds.
引用
收藏
页码:131 / 146
页数:16
相关论文
共 30 条
  • [1] Antoniou A, 2019, ADV NEUR IN, V32
  • [2] Ashok A., 2021, ML Reproducibility Challenge
  • [3] Easy-Ensemble Augmented-Shot-Y-Shaped Learning: State-of-the-Art Few-Shot Classification with Simple Components
    Bendou, Yassir
    Hu, Yuqing
    Lafargue, Raphael
    Lioi, Giulia
    Pasdeloup, Bastien
    Pateux, Stephane
    Gripon, Vincent
    [J]. JOURNAL OF IMAGING, 2022, 8 (07)
  • [4] Dong CAQ, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P716
  • [5] Finn C, 2017, PR MACH LEARN RES, V70
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
    He, Yangji
    Liang, Weihan
    Zhao, Dongyang
    Zhou, Hong-Yu
    Ge, Weifeng
    Yu, Yizhou
    Zhang, Wenqiang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9109 - 9119
  • [8] Hou RB, 2019, ADV NEUR IN, V32
  • [9] Hu P, 2019, Arxiv, DOI arXiv:1906.04833
  • [10] King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001