Attention-optimized vision-enhanced prompt learning for few-shot multi-modal sentiment analysis

被引：0

作者：

Zhou, Zikai ^{[1
]}

Qiao, Baiyou ^{[1
]}

Feng, Haisong ^{[2
]}

Han, Donghong ^{[1
]}

Wu, Gang ^{[1
]}

机构：

[1] School of Computer Science and Engineering, Northeastern University, Shenyang

[2] School of Informatics, Xiamen University, Xiamen

来源：

Neural Computing and Applications | 2024年 / 36卷 / 33期

基金：

中国国家自然科学基金;

关键词：

Few-shot learning; GCN; Multi-modal sentiment analysis; Prompt learning;

D O I：

10.1007/s00521-024-10297-w

中图分类号：

学科分类号：

摘要：

To fulfill the explosion of multi-modal data, multi-modal sentiment analysis (MSA) emerged and attracted widespread attention. Unfortunately, conventional multi-modal research relies on large-scale datasets. On the one hand, collecting and annotating large-scale datasets is challenging and resource-intensive. On the other hand, the training on large-scale datasets also increases the research cost. However, the few-shot MSA (FMSA), which is proposed recently, requires only few samples for training. Therefore, in comparison, it is more practical and realistic. There have been approaches to investigating the prompt-based method in the field of FMSA, but they have not sufficiently considered or leveraged the information specificity of visual modality. Thus, we propose a vision-enhanced prompt-based model based on graph structure to better utilize vision information for fusion and collaboration in encoding and optimizing prompt representations. Specifically, we first design an aggregation-based multi-modal attention module. Then, based on this module and the biaffine attention, we construct a syntax–semantic dual-channel graph convolutional network to optimize the encoding of learnable prompts by understanding the vision-enhanced information in semantic and syntactic knowledge. Finally, we propose a collaboration-based optimization module based on the collaborative attention mechanism, which employs visual information to collaboratively optimize prompt representations. Extensive experiments conducted on both coarse-grained and fine-grained MSA datasets have demonstrated that our model significantly outperforms the baseline models. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

引用

页码：21091 / 21105

页数：14

共 50 条

[31] Pathology-Knowledge Enhanced Multi-instance Prompt Learning for Few-Shot Whole Slide Image Classification
Qu, Linhao
Yang, Dingkang
Huang, Dan
Guo, Qinhao
Luo, Rongkui
Zhang, Shaoting
Wang, Xiaosong
COMPUTER VISION - ECCV 2024, PT XI, 2025, 15069 : 196 - 212
[32] Few-shot Aspect Category Sentiment Analysis via Meta-learning
Liang, Bin
Li, Xiang
Gui, Lin
Fu, Yonghao
He, Yulan
Yang, Min
Xu, Ruifeng
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (01)
[33] Multi-level adaptive few-shot learning network combined with vision transformer
Zhu H.
Cai X.
Dou J.
Gao Z.
Zhang L.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12477 - 12491
[34] MARANet: Multi-scale Adaptive Region Attention Network for Few-Shot Learning
Chen, Jia
Li, Xiyang
Ou, Yangjun
Hu, Xinrong
Peng, Tao
ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 415 - 426
[35] Few-Shot Learning Based on Dimensionally Enhanced Attention and Logit Standardization Self-Distillation
Tang, Yuhong
Li, Guang
Zhang, Ming
Li, Jianjun
ELECTRONICS, 2024, 13 (15)
[36] Prototypical Network with Instance-Level Attention in Multi-Label Few-Shot Learning
Luo S.
Zhang R.
Pan L.
Wu Z.
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (04): : 403 - 409
[37] FSCA: Few-Shot Learning via Embedding Adaptation with Corner Multi-Head Attention
Xu, Rui
Huang, Jitao
Li, Yuqi
Dong, Dianxin
Liu, Shuang
Tian, Zijing
Ou, Zhonghong
Song, Meina
ELECTRONICS, 2025, 14 (01):
[38] Attention-enhanced corn disease diagnosis using few-shot learning and VGG16
Rani, Ruchi
Sahoo, Jayakrushna
Bellamkonda, Sivaiah
Kumar, Sumit
METHODSX, 2025, 14
[39] Few-shot learning approach with multi-scale feature fusion and attention for plant disease recognition
Lin, Hong
Tse, Rita
Tang, Su-Kit
Qiang, Zhen-ping
Pau, Giovanni
FRONTIERS IN PLANT SCIENCE, 2022, 13
[40] MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning
Junyan Yang
Jie Jiang
Yanming Guo
International Journal of Multimedia Information Retrieval, 2022, 11 : 681 - 694

← 1 2 3 4 5 →