Multi-scale task-aware structure graph modeling for few-shot image recognition

被引：0

作者：

Zhao, Peng ^{[1
]}

Ye, Zilong ^{[1
,2
]}

Wang, Liang ^{[1
,2
]}

Liu, Huiting ^{[1
,2
]}

Ji, Xia ^{[1
,2
]}

机构：

[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Peoples R China

[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 156卷

基金：

中国国家自然科学基金;

关键词：

Few-shot learning; Multi-scale representation; Task-aware; Graph attention network;

D O I：

10.1016/j.patcog.2024.110855

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Few-shot image recognition attempts to recognize images from a novel class with only a limited number of labeled training images, which is a few-shot learning (FSL) task. FSL is very challenging. Limited labeled training samples cannot adequately represent the distribution of classes, and the base and novel classes in the training and testing stages do not intersect and have different distributions, leading to a domain shift problem in generalizing the learned model to the novel class dataset. In this paper, we propose multi-scale task-aware structure graph modeling for few-shot image recognition. We train a meta-filter learner to generate task-aware local structure filters for each scale and adaptively capture the local structures at each scale. Moreover, we introduce a novel multi-scale graph attention network (MGAT) module to model the multi-scale local structures of an image, fully exploring the correlations between different local structures at all scales of the image. Finally, we leverage the attention mechanism of graph attention network to achieve information aggregation and propagation, aiming to obtain more representative and discriminative local structure features that integrate both local and global information. We conducted comprehensive experiments on four benchmark datasets widely adopted in FSL tasks. The experimental results demonstrate that the MTSGM obtains state-of-the-art performance, which validates the effectiveness of MTSGM.

引用

页数：13

共 50 条

[31] Multi-scale Few-Shot Classification Model Based on Attention Mechanism
Xu, Yi
Zhu, Qisheng
Pan, ZhengYue
Liu, Yin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 476 - 487
[32] DDC: Dynamic distribution calibration for few-shot learning under multi-scale representation
Chen, Lingxing
Gu, Yang
Guo, Yi
Dong, Fan
Jiang, Dongmei
Chen, Yiqiang
KNOWLEDGE-BASED SYSTEMS, 2025, 311
[33] Multi-level Metric Learning for Few-Shot Image Recognition
Chen, Haoxing
Li, Huaxiong
Li, Yaohui
Chen, Chunlin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254
[34] Few-shot classification in Named Entity Recognition Task
Fritzler, Alexander
Logacheva, Varvara
Kretov, Maksim
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 993 - 1000
[35] Multi-scale cross-attention transformer via graph embeddings for few-shot molecular property prediction
Torres, Luis H. M.
Ribeiro, Bernardete
Arrais, Joel P.
APPLIED SOFT COMPUTING, 2024, 153
[36] Multi-scale feature self-enhancement network for few-shot learning
Dong, Bowen
Wang, Ronggui
Yang, Juan
Xue, Lixia
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (25) : 33865 - 33883
[37] Multi-scale Attention-Based Few-Shot Hyperspectral Images Classification
Ding, Lanwei
Cao, Guo
Xu, Ling
Deng, Lindiao
Xu, Hao
Pan, Qikun
Shang, Yanfeng
FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
[38] Multi-scale kronecker-product relation networks for few-shot learning
Mounir Abdelaziz
Zuping Zhang
Multimedia Tools and Applications, 2022, 81 : 6703 - 6722
[39] Spatial-temporal multi-scale interaction for few-shot video summarization
Li, Qun
Zhan, Zhuxi
Li, Yanchao
Bhanu, Bir
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
[40] MARANet: Multi-scale Adaptive Region Attention Network for Few-Shot Learning
Chen, Jia
Li, Xiyang
Ou, Yangjun
Hu, Xinrong
Peng, Tao
ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 415 - 426

← 1 2 3 4 5 →