Multi-Attention Based Visual-Semantic Interaction for Few-Shot Learning

被引:0
|
作者
Zhao, Peng [1 ]
Wang, Yin [1 ]
Wang, Wei [2 ]
Mu, Jie [3 ]
Liu, Huiting [1 ]
Wang, Cong [2 ,4 ]
Cao, Xiaochun [2 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[2] Shenzhen Campus Sun Yat Sen Univ, Sch Cyber Sci & Technol, Guangzhou, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Data Sci & Artificial Intelligence, Dalian, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-Shot Learning (FSL) aims to train a model that can generalize to recognize new classes, with each new class having only very limited training samples. Since extracting discriminative features for new classes with few samples is challenging, existing FSL methods leverage visual and semantic prior knowledge to guide discriminative feature learning. However, for meta-learning purposes, the semantic knowledge of the query set is unavailable, so their features lack discriminability. To address this problem, we propose a novel Multi-Attention based Visual-Semantic Interaction (MAVSI) approach for FSL. Specifically, we utilize spatial and channel attention mechanisms to effectively select discriminative visual features for the support set based on its ground-truth semantics while using all the support set semantics for each query set sample. Then, a relation module with class prototypes of the support set is employed to supervise and select discriminative visual features for the query set. To further enhance the discriminability of the support set, we introduce a visual-semantic contrastive learning module to promote the similarity between visual features and their corresponding semantic features. Extensive experiments on four benchmark datasets demonstrate that our proposed MAVSI could outperform existing state-of-the-art FSL methods.
引用
收藏
页码:1753 / 1761
页数:9
相关论文
共 50 条
  • [31] A Self-Supervised Few-Shot Semantic Segmentation Method Based on Multi-Task Learning and Dense Attention Computation
    Yi, Kai
    Wang, Weihang
    Zhang, Yi
    SENSORS, 2024, 24 (15)
  • [32] LEARNING SEMANTICS-GUIDED VISUAL ATTENTION FOR FEW-SHOT IMAGE CLASSIFICATION
    Chu, Wen-Hsuan
    Wang, Yu-Chiang Frank
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2979 - 2983
  • [33] Visual driving assistance system based on few-shot learning
    Liu, Shan
    Tang, Yichao
    Tian, Ying
    Su, Hansong
    MULTIMEDIA SYSTEMS, 2023, 29 (05) : 2853 - 2863
  • [34] Visual driving assistance system based on few-shot learning
    Shan Liu
    Yichao Tang
    Ying Tian
    Hansong Su
    Multimedia Systems, 2023, 29 : 2853 - 2863
  • [35] Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7372 - 7378
  • [36] ARNET:ATTENTION-BASED REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Li, Rusheng
    Liu, Hanhui
    Zhu, Yuesheng
    Bai, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2238 - 2242
  • [37] Visual Classification of Malware by Few-shot Learning
    Tran, Kien
    Kubo, Masao
    Sato, Hiroshi
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 770 - 774
  • [38] Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation
    Chen, Yadang
    Jiang, Ren
    Zheng, Yuhui
    Sheng, Bin
    Yang, Zhi-Xin
    Wu, Enhua
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1432 - 1447
  • [39] Multi-label Few-shot Learning with Semantic Inference (Student Abstract)
    Wang, Zhen
    Duan, Yiqun
    Liu, Liu
    Tao, Dacheng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15917 - 15918
  • [40] tSF: Transformer-Based Semantic Filter for Few-Shot Learning
    Lai, Jinxiang
    Yang, Siqian
    Liu, Wenlong
    Zeng, Yi
    Huang, Zhongyi
    Wu, Wenlong
    Liu, Jun
    Gao, Bin-Bin
    Wang, Chengjie
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 1 - 19