Multi-scale task-aware structure graph modeling for few-shot image recognition

被引:0
作者
Zhao, Peng [1 ]
Ye, Zilong [1 ,2 ]
Wang, Liang [1 ,2 ]
Liu, Huiting [1 ,2 ]
Ji, Xia [1 ,2 ]
机构
[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot learning; Multi-scale representation; Task-aware; Graph attention network;
D O I
10.1016/j.patcog.2024.110855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Few-shot image recognition attempts to recognize images from a novel class with only a limited number of labeled training images, which is a few-shot learning (FSL) task. FSL is very challenging. Limited labeled training samples cannot adequately represent the distribution of classes, and the base and novel classes in the training and testing stages do not intersect and have different distributions, leading to a domain shift problem in generalizing the learned model to the novel class dataset. In this paper, we propose multi-scale task-aware structure graph modeling for few-shot image recognition. We train a meta-filter learner to generate task-aware local structure filters for each scale and adaptively capture the local structures at each scale. Moreover, we introduce a novel multi-scale graph attention network (MGAT) module to model the multi-scale local structures of an image, fully exploring the correlations between different local structures at all scales of the image. Finally, we leverage the attention mechanism of graph attention network to achieve information aggregation and propagation, aiming to obtain more representative and discriminative local structure features that integrate both local and global information. We conducted comprehensive experiments on four benchmark datasets widely adopted in FSL tasks. The experimental results demonstrate that the MTSGM obtains state-of-the-art performance, which validates the effectiveness of MTSGM.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Multi-scale Few-Shot Classification Model Based on Attention Mechanism
    Xu, Yi
    Zhu, Qisheng
    Pan, ZhengYue
    Liu, Yin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 476 - 487
  • [32] DDC: Dynamic distribution calibration for few-shot learning under multi-scale representation
    Chen, Lingxing
    Gu, Yang
    Guo, Yi
    Dong, Fan
    Jiang, Dongmei
    Chen, Yiqiang
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [33] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254
  • [34] Few-shot classification in Named Entity Recognition Task
    Fritzler, Alexander
    Logacheva, Varvara
    Kretov, Maksim
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 993 - 1000
  • [35] Multi-scale cross-attention transformer via graph embeddings for few-shot molecular property prediction
    Torres, Luis H. M.
    Ribeiro, Bernardete
    Arrais, Joel P.
    APPLIED SOFT COMPUTING, 2024, 153
  • [36] Multi-scale feature self-enhancement network for few-shot learning
    Dong, Bowen
    Wang, Ronggui
    Yang, Juan
    Xue, Lixia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (25) : 33865 - 33883
  • [37] Multi-scale Attention-Based Few-Shot Hyperspectral Images Classification
    Ding, Lanwei
    Cao, Guo
    Xu, Ling
    Deng, Lindiao
    Xu, Hao
    Pan, Qikun
    Shang, Yanfeng
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [38] Multi-scale kronecker-product relation networks for few-shot learning
    Mounir Abdelaziz
    Zuping Zhang
    Multimedia Tools and Applications, 2022, 81 : 6703 - 6722
  • [39] Spatial-temporal multi-scale interaction for few-shot video summarization
    Li, Qun
    Zhan, Zhuxi
    Li, Yanchao
    Bhanu, Bir
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [40] MARANet: Multi-scale Adaptive Region Attention Network for Few-Shot Learning
    Chen, Jia
    Li, Xiyang
    Ou, Yangjun
    Hu, Xinrong
    Peng, Tao
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 415 - 426