Multi-Attention Based Visual-Semantic Interaction for Few-Shot Learning

被引:0
|
作者
Zhao, Peng [1 ]
Wang, Yin [1 ]
Wang, Wei [2 ]
Mu, Jie [3 ]
Liu, Huiting [1 ]
Wang, Cong [2 ,4 ]
Cao, Xiaochun [2 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[2] Shenzhen Campus Sun Yat Sen Univ, Sch Cyber Sci & Technol, Guangzhou, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Data Sci & Artificial Intelligence, Dalian, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-Shot Learning (FSL) aims to train a model that can generalize to recognize new classes, with each new class having only very limited training samples. Since extracting discriminative features for new classes with few samples is challenging, existing FSL methods leverage visual and semantic prior knowledge to guide discriminative feature learning. However, for meta-learning purposes, the semantic knowledge of the query set is unavailable, so their features lack discriminability. To address this problem, we propose a novel Multi-Attention based Visual-Semantic Interaction (MAVSI) approach for FSL. Specifically, we utilize spatial and channel attention mechanisms to effectively select discriminative visual features for the support set based on its ground-truth semantics while using all the support set semantics for each query set sample. Then, a relation module with class prototypes of the support set is employed to supervise and select discriminative visual features for the query set. To further enhance the discriminability of the support set, we introduce a visual-semantic contrastive learning module to promote the similarity between visual features and their corresponding semantic features. Extensive experiments on four benchmark datasets demonstrate that our proposed MAVSI could outperform existing state-of-the-art FSL methods.
引用
收藏
页码:1753 / 1761
页数:9
相关论文
共 50 条
  • [21] LEARNING WITH MEMORY FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lu, Hongchao
    Wei, Chao
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 629 - 633
  • [22] Relational Action Bank with Semantic-Visual Attention for Few-Shot Action Recognition
    Liang, Haoming
    Du, Jinze
    Zhang, Hongchen
    Han, Bing
    Ma, Yan
    FUTURE INTERNET, 2023, 15 (03)
  • [23] Multi-level Attention Feature Network for Few-shot Learning
    Wang R.
    Han M.
    Yang J.
    Xue L.
    Hu M.
    Yang, Juan (yangjuan@hfut.edu.cn), 1600, Science Press (42): : 772 - 778
  • [24] Multi-level Attention Feature Network for Few-shot Learning
    Wang Ronggui
    Han Mengya
    Yang Juan
    Xue Lixia
    Hu Min
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (03) : 772 - 778
  • [25] Multi Aspect Attention Online Integrated Distillation For Few-shot Learning
    Wang, Cailing
    Wei, Qingchen
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4047 - 4051
  • [26] Semantic-Guided Multi-Attention Localization for Zero-Shot Learning
    Zhu, Yizhe
    Xie, Jianwen
    Tang, Zhiqiang
    Peng, Xi
    Elgammal, Ahmed
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] Spatial Attention Network for Few-Shot Learning
    He, Xianhao
    Qiao, Peng
    Dou, Yong
    Niu, Xin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 567 - 578
  • [28] Reinforced Attention for Few-Shot Learning and Beyond
    Hong, Jie
    Fang, Pengfei
    Li, Weihao
    Zhang, Tong
    Simon, Christian
    Harandi, Mehrtash
    Petersson, Lars
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 913 - 923
  • [29] Attention Relational Network for Few-Shot Learning
    Shuai, Jia
    Chen, JiaMing
    Yang, Meng
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 163 - 174
  • [30] Multi-attention Network for One Shot Learning
    Wang, Peng
    Liu, Lingqiao
    Shen, Chunhua
    Huang, Zi
    van den Hengel, Anton
    Shen, Heng Tao
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6212 - 6220