SAPENet: Self-Attention based Prototype Enhancement Network for Few-shot Learning

被引:35
|
作者
Huang, Xilang [1 ]
Choi, Seon Han [2 ,3 ]
机构
[1] Pukyong Natl Univ, Dept Artificial Intelligent Convergence, Pusan 48513, South Korea
[2] Ewha Womans Univ, Dept Elect & Elect Engn, Seoul 03760, South Korea
[3] Ewha Womans Univ, Grad Program Smart Factory, Seoul 03760, South Korea
基金
新加坡国家研究基金会;
关键词
Few -shot learning; Multi -head self -attention mechanism; Image classification; k -Nearest neighbor;
D O I
10.1016/j.patcog.2022.109170
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot learning considers the problem of learning unseen categories given only a few labeled samples. As one of the most popular few-shot learning approaches, Prototypical Networks have received considerable attention owing to their simplicity and efficiency. However, a class prototype is typically obtained by averaging a few labeled samples belonging to the same class, which treats the samples as equally important and is thus prone to learning redundant features. Herein, we propose a self-attention based prototype enhancement network (SAPENet) to obtain a more representative prototype for each class. SAPENet utilizes multi-head self-attention mechanisms to selectively augment discriminative features in each sample feature map, and generates channel attention maps between intra-class sample features to attentively retain informative channel features for that class. The augmented feature maps and attention maps are finally fused to obtain representative class prototypes. Thereafter, a local descriptor-based metric module is employed to fully exploit the channel information of the prototypes by searching k similar local descriptors of the prototype for each local descriptor in the unlabeled samples for classification. We performed experiments on multiple benchmark datasets: miniImageNet, tieredImageNet, and CUB-200-2011. The experimental results on these datasets show that SAPENet achieves a considerable improvement compared to Prototypical Networks and also outperforms related state-of-the-art methods.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
    Tang, Yin
    Chen, Tao
    Jiang, Xiruo
    Yao, Yazhou
    Xie, Guo-Sen
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6699 - 6709
  • [22] Adaptive Meta Transfer Learning with Efficient Self-Attention for Few-Shot Bearing Fault Diagnosis
    Zhao, Jun
    Tang, Tang
    Yu, Ying
    Wang, Jingwei
    Yang, Tianyuan
    Chen, Ming
    Wu, Jie
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 949 - 968
  • [23] Adaptive Meta Transfer Learning with Efficient Self-Attention for Few-Shot Bearing Fault Diagnosis
    Jun Zhao
    Tang Tang
    Ying Yu
    Jingwei Wang
    Tianyuan Yang
    Ming Chen
    Jie Wu
    Neural Processing Letters, 2023, 55 : 949 - 968
  • [24] Contrastive prototype loss based discriminative feature network for few-shot learning
    Yan, Leilei
    He, Feihong
    Zheng, Xiaohan
    Zhang, Li
    Zhang, Yiqi
    He, Jiangzhen
    Du, Weidong
    Wang, Yansong
    Li, Fanzhang
    APPLIED INTELLIGENCE, 2025, 55 (05)
  • [25] Mutual Learning Prototype Network for Few-Shot Text Classification
    Liu, Jun
    Qin, Xiaorui
    Tao, Jian
    Dong, Hongfei
    Li, Xiaoxu
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (03): : 30 - 35
  • [26] SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning
    Yang, Fengyuan
    Wang, Ruiping
    Chen, Xilin
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1586 - 1596
  • [27] Complementary features based prototype self-updating for few-shot learning
    Xu, Xinlei
    Wang, Zhe
    Chi, Ziqiu
    Yang, Hai
    Du, Wenli
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [28] Attention Based Siamese Networks for Few-Shot Learning
    Wang, Junhua
    Zhu, Zijiang
    Li, Jianjun
    Li, Junshan
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 551 - 554
  • [29] Multi-instance attention network for few-shot learning
    Qin, Zhili
    Wang, Han
    Mawuli, Cobbinah Bernard
    Han, Wei
    Zhang, Rui
    Yang, Qinli
    Shao, Junming
    INFORMATION SCIENCES, 2022, 611 : 464 - 475
  • [30] Multi-instance attention network for few-shot learning
    Qin, Zhili
    Wang, Han
    Mawuli, Cobbinah Bernard
    Han, Wei
    Zhang, Rui
    Yang, Qinli
    Shao, Junming
    Information Sciences, 2022, 611 : 464 - 475