Sentiment Knowledge Enhanced Self-supervised Learning for Multimodal Sentiment Analysis

被引：0

作者：

Qian, Fan ^{[1
]}

Han, Jiqing ^{[1
]}

He, Yongjun ^{[1
]}

Zheng, Tieran ^{[1
]}

Zheng, Guibin ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal Sentiment Analysis (MSA) has made great progress that benefits from extraordinary fusion scheme. However, there is a lack of labeled data, resulting in severe overfitting and poor generalization for supervised models applied in this field. In this paper, we propose Sentiment Knowledge Enhanced Self-supervised Learning (SKESL) to capture common sentimental patterns in unlabeled videos, which facilitates further learning on limited labeled data. Specifically, with the help of sentiment knowledge and non-verbal behavior, SKESL conducts sentiment word masking and predicts fine-grained word sentiment intensity, so as to embed sentiment information at the word level into pre-trained multimodal representation. In addition, a non-verbal injection method is also proposed to integrate non-verbal information into the word semantics. Experiments on two standard benchmarks of MSA clearly show that SKESL significantly outperforms the baseline, and achieves new State-Of-The-Art (SOTA) results.

引用

页码：12966 / 12978

页数：13

共 57 条

[1]

Akhtar MS, 2019, ARXIV

[2] Emotion Recognition in Speech using Cross-Modal Transfer in the Wild [J].

Albanie, Samuel ;

Nagrani, Arsha ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :292-301

[3]

[Anonymous], 2020, ADV NEURAL INFORM PR, V33, P1877, DOI DOI 10.5555/3495724.3495883

[4] OpenFace 2.0: Facial Behavior Analysis Toolkit [J].

Baltrusaitis, Tadas ;

Zadeh, Amir ;

Lim, Yao Chong ;

Morency, Louis-Philippe .

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :59-66

[5]

Chakrabarty Tuhin, 2020, ARXIV

[6]

Chauhan Dushyant Singh, 2020, ACL

[7]

Chen Minping, 2020, COLING

[8]

Chung Joon Son, 2018, arXiv

[9] Pre-Training With Whole Word Masking for Chinese BERT [J].

Cui, Yiming ;

Che, Wanxiang ;

Liu, Ting ;

Qin, Bing ;

Yang, Ziqing .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :3504-3514

[10]

Dai Wenliang, 2021, arXiv

← 1 2 3 4 5 6 →