Attention-optimized vision-enhanced prompt learning for few-shot multi-modal sentiment analysis

被引:0
|
作者
Zhou, Zikai [1 ]
Qiao, Baiyou [1 ]
Feng, Haisong [2 ]
Han, Donghong [1 ]
Wu, Gang [1 ]
机构
[1] School of Computer Science and Engineering, Northeastern University, Shenyang
[2] School of Informatics, Xiamen University, Xiamen
基金
中国国家自然科学基金;
关键词
Few-shot learning; GCN; Multi-modal sentiment analysis; Prompt learning;
D O I
10.1007/s00521-024-10297-w
中图分类号
学科分类号
摘要
To fulfill the explosion of multi-modal data, multi-modal sentiment analysis (MSA) emerged and attracted widespread attention. Unfortunately, conventional multi-modal research relies on large-scale datasets. On the one hand, collecting and annotating large-scale datasets is challenging and resource-intensive. On the other hand, the training on large-scale datasets also increases the research cost. However, the few-shot MSA (FMSA), which is proposed recently, requires only few samples for training. Therefore, in comparison, it is more practical and realistic. There have been approaches to investigating the prompt-based method in the field of FMSA, but they have not sufficiently considered or leveraged the information specificity of visual modality. Thus, we propose a vision-enhanced prompt-based model based on graph structure to better utilize vision information for fusion and collaboration in encoding and optimizing prompt representations. Specifically, we first design an aggregation-based multi-modal attention module. Then, based on this module and the biaffine attention, we construct a syntax–semantic dual-channel graph convolutional network to optimize the encoding of learnable prompts by understanding the vision-enhanced information in semantic and syntactic knowledge. Finally, we propose a collaboration-based optimization module based on the collaborative attention mechanism, which employs visual information to collaboratively optimize prompt representations. Extensive experiments conducted on both coarse-grained and fine-grained MSA datasets have demonstrated that our model significantly outperforms the baseline models. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:21091 / 21105
页数:14
相关论文
共 50 条
  • [31] Pathology-Knowledge Enhanced Multi-instance Prompt Learning for Few-Shot Whole Slide Image Classification
    Qu, Linhao
    Yang, Dingkang
    Huang, Dan
    Guo, Qinhao
    Luo, Rongkui
    Zhang, Shaoting
    Wang, Xiaosong
    COMPUTER VISION - ECCV 2024, PT XI, 2025, 15069 : 196 - 212
  • [32] Few-shot Aspect Category Sentiment Analysis via Meta-learning
    Liang, Bin
    Li, Xiang
    Gui, Lin
    Fu, Yonghao
    He, Yulan
    Yang, Min
    Xu, Ruifeng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (01)
  • [33] Multi-level adaptive few-shot learning network combined with vision transformer
    Zhu H.
    Cai X.
    Dou J.
    Gao Z.
    Zhang L.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12477 - 12491
  • [34] MARANet: Multi-scale Adaptive Region Attention Network for Few-Shot Learning
    Chen, Jia
    Li, Xiyang
    Ou, Yangjun
    Hu, Xinrong
    Peng, Tao
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 415 - 426
  • [35] Few-Shot Learning Based on Dimensionally Enhanced Attention and Logit Standardization Self-Distillation
    Tang, Yuhong
    Li, Guang
    Zhang, Ming
    Li, Jianjun
    ELECTRONICS, 2024, 13 (15)
  • [36] Prototypical Network with Instance-Level Attention in Multi-Label Few-Shot Learning
    Luo S.
    Zhang R.
    Pan L.
    Wu Z.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (04): : 403 - 409
  • [37] FSCA: Few-Shot Learning via Embedding Adaptation with Corner Multi-Head Attention
    Xu, Rui
    Huang, Jitao
    Li, Yuqi
    Dong, Dianxin
    Liu, Shuang
    Tian, Zijing
    Ou, Zhonghong
    Song, Meina
    ELECTRONICS, 2025, 14 (01):
  • [38] Attention-enhanced corn disease diagnosis using few-shot learning and VGG16
    Rani, Ruchi
    Sahoo, Jayakrushna
    Bellamkonda, Sivaiah
    Kumar, Sumit
    METHODSX, 2025, 14
  • [39] Few-shot learning approach with multi-scale feature fusion and attention for plant disease recognition
    Lin, Hong
    Tse, Rita
    Tang, Su-Kit
    Qiang, Zhen-ping
    Pau, Giovanni
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [40] MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning
    Junyan Yang
    Jie Jiang
    Yanming Guo
    International Journal of Multimedia Information Retrieval, 2022, 11 : 681 - 694