Benchmarking Multimodal Sentiment Analysis

被引:31
作者
Cambria, Erik [1 ]
Hazarika, Devamanyu [2 ]
Poria, Soujanya [3 ]
Hussain, Amir [4 ]
Subramanyam, R. B. V. [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Natl Inst Technol, Warangal, Andhra Pradesh, India
[3] Nanyang Technol Univ, Temasek Labs, Singapore, Singapore
[4] Univ Stirling, Sch Nat Sci, Stirling, Scotland
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II | 2018年 / 10762卷
关键词
Multimodal sentiment analysis; Emotion detection; Deep learning; Convolutional neural networks; EMOTION RECOGNITION;
D O I
10.1007/978-3-319-77116-8_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a deep-learning-based framework for multimodal sentiment analysis and emotion recognition. In particular, we leverage on the power of convolutional neural networks to obtain a performance improvement of 10% over the state of the art by combining visual, text and audio features. We also discuss some major issues frequently ignored in multimodal sentiment analysis research, e.g., role of speaker-independent models, importance of different modalities, and generalizability. The framework illustrates the different facets of analysis to be considered while performing multimodal sentiment analysis and, hence, serves as a new benchmark for future research in this emerging field.
引用
收藏
页码:166 / 179
页数:14
相关论文
共 28 条
[1]  
Baltrusaitis T, 2012, PROC CVPR IEEE, P2610, DOI 10.1109/CVPR.2012.6247980
[2]   IEMOCAP: interactive emotional dyadic motion capture database [J].
Busso, Carlos ;
Bulut, Murtaza ;
Lee, Chi-Chun ;
Kazemzadeh, Abe ;
Mower, Emily ;
Kim, Samuel ;
Chang, Jeannette N. ;
Lee, Sungbok ;
Narayanan, Shrikanth S. .
LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (04) :335-359
[3]  
Cambria E, 2018, AAAI CONF ARTIF INTE, P1795
[4]   Affective Computing and Sentiment Analysis [J].
Cambria, Erik .
IEEE INTELLIGENT SYSTEMS, 2016, 31 (02) :102-107
[5]  
Cambria E, 2010, LECT NOTES COMPUT SC, V5967, P148
[6]   Multimodal human emotion/expression recognition [J].
Chen, LS ;
Huang, TS ;
Miyasato, T ;
Nakatsu, R .
AUTOMATIC FACE AND GESTURE RECOGNITION - THIRD IEEE INTERNATIONAL CONFERENCE PROCEEDINGS, 1998, :366-371
[7]  
Datcu D., 2008, SEMANTIC AUDIO VISUA
[8]  
De Silva LC, 1997, ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3, P397, DOI 10.1109/ICICS.1997.647126
[9]  
Ekman P., 1974, CONT READINGS CHICAG
[10]  
Eyben F., 2010, P 18 ACM INT C MULT, P1459