Multi-Attention Based Visual-Semantic Interaction for Few-Shot Learning

被引:0
|
作者
Zhao, Peng [1 ]
Wang, Yin [1 ]
Wang, Wei [2 ]
Mu, Jie [3 ]
Liu, Huiting [1 ]
Wang, Cong [2 ,4 ]
Cao, Xiaochun [2 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[2] Shenzhen Campus Sun Yat Sen Univ, Sch Cyber Sci & Technol, Guangzhou, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Data Sci & Artificial Intelligence, Dalian, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-Shot Learning (FSL) aims to train a model that can generalize to recognize new classes, with each new class having only very limited training samples. Since extracting discriminative features for new classes with few samples is challenging, existing FSL methods leverage visual and semantic prior knowledge to guide discriminative feature learning. However, for meta-learning purposes, the semantic knowledge of the query set is unavailable, so their features lack discriminability. To address this problem, we propose a novel Multi-Attention based Visual-Semantic Interaction (MAVSI) approach for FSL. Specifically, we utilize spatial and channel attention mechanisms to effectively select discriminative visual features for the support set based on its ground-truth semantics while using all the support set semantics for each query set sample. Then, a relation module with class prototypes of the support set is employed to supervise and select discriminative visual features for the query set. To further enhance the discriminability of the support set, we introduce a visual-semantic contrastive learning module to promote the similarity between visual features and their corresponding semantic features. Extensive experiments on four benchmark datasets demonstrate that our proposed MAVSI could outperform existing state-of-the-art FSL methods.
引用
收藏
页码:1753 / 1761
页数:9
相关论文
共 50 条
  • [1] Hierarchical Graph Attention Network for Few-shot Visual-Semantic Learning
    Yin, Chengxiang
    Wu, Kun
    Che, Zhengping
    Jiang, Bo
    Xu, Zhiyuan
    Tang, Jian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2157 - 2166
  • [2] Visual-Semantic Cooperative Learning for Few-Shot SAR Target Classification
    Wang, Siyuan
    Wang, Yinghua
    Zhang, Xiaoting
    Zhang, Chen
    Liu, Hongwei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 6532 - 6550
  • [3] Multi-attention mutual information distributed framework for few-shot learning
    Wang, Zhe
    Ma, Pingchuan
    Chi, Ziqiu
    Li, Dongdong
    Yang, Hai
    Du, Wenli
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 202
  • [4] VSA: Adaptive Visual and Semantic Guided Attention on Few-Shot Learning
    Chai, Jin
    Chen, Yisheng
    Shen, Weinan
    Zhang, Tong
    Chen, C. L. Philip
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 280 - 292
  • [5] SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning
    Yang, Fengyuan
    Wang, Ruiping
    Chen, Xilin
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1586 - 1596
  • [6] Visual-Semantic Alignment for Few-shot Remote Sensing Scene Classification
    Li, Haojun
    Li, Linjia
    Luo, Wei
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 411 - 417
  • [7] Multi-attention Meta Learning for Few-shot Fine-grained Image Recognition
    Zhu, Yaohui
    Liu, Chenlong
    Jiang, Shuqiang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1090 - 1096
  • [8] Few-Shot Image and Sentence Matching via Gated Visual-Semantic Embedding
    Huang, Yan
    Long, Yang
    Wang, Liang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8489 - 8496
  • [9] Multi-attention fusion and weighted class representation for few-shot classification
    赵文仓
    QIN Wenqian
    LI Ming
    HighTechnologyLetters, 2022, 28 (03) : 295 - 306
  • [10] Few-Shot Few-Shot Learning and the role of Spatial Attention
    Lifchitz, Yann
    Avrithis, Yannis
    Picard, Sylvaine
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2693 - 2700