Benchmarking Multimodal Sentiment Analysis

被引：35

作者：

Cambria, Erik ^{[1
]}

Hazarika, Devamanyu ^{[2
]}

Poria, Soujanya ^{[3
]}

Hussain, Amir ^{[4
]}

Subramanyam, R. B. V. ^{[2
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[2] Natl Inst Technol, Warangal, Andhra Pradesh, India

[3] Nanyang Technol Univ, Temasek Labs, Singapore, Singapore

[4] Univ Stirling, Sch Nat Sci, Stirling, Scotland

来源：

COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II | 2018年 / 10762卷

关键词：

Multimodal sentiment analysis; Emotion detection; Deep learning; Convolutional neural networks; EMOTION RECOGNITION;

D O I：

10.1007/978-3-319-77116-8_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a deep-learning-based framework for multimodal sentiment analysis and emotion recognition. In particular, we leverage on the power of convolutional neural networks to obtain a performance improvement of 10% over the state of the art by combining visual, text and audio features. We also discuss some major issues frequently ignored in multimodal sentiment analysis research, e.g., role of speaker-independent models, importance of different modalities, and generalizability. The framework illustrates the different facets of analysis to be considered while performing multimodal sentiment analysis and, hence, serves as a new benchmark for future research in this emerging field.

引用

页码：166 / 179

页数：14

共 28 条

[1]

Baltrusaitis T, 2012, PROC CVPR IEEE, P2610, DOI 10.1109/CVPR.2012.6247980

[2] IEMOCAP: interactive emotional dyadic motion capture database [J].

Busso, Carlos ;

Bulut, Murtaza ;

Lee, Chi-Chun ;

Kazemzadeh, Abe ;

Mower, Emily ;

Kim, Samuel ;

Chang, Jeannette N. ;

Lee, Sungbok ;

Narayanan, Shrikanth S. .

LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (04) :335-359

[3]

Cambria E, 2018, AAAI CONF ARTIF INTE, P1795

[4] Affective Computing and Sentiment Analysis [J].

Cambria, Erik .

IEEE INTELLIGENT SYSTEMS, 2016, 31 (02) :102-107

[5]

Cambria E, 2010, LECT NOTES COMPUT SC, V5967, P148

[6] Multimodal human emotion/expression recognition [J].

Chen, LS ;

Huang, TS ;

Miyasato, T ;

Nakatsu, R .

AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, :366-371

[7]

Datcu D., 2008, SEMANTIC AUDIO VISUA

[8]

De Silva LC, 1997, ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3, P397, DOI 10.1109/ICICS.1997.647126

[9]

Ekman P., 1974, CONT READINGS CHICAG

[10]

Eyben F., 2010, P 18 ACM INT C MULT, P1459

← 1 2 3 →