Towards an intelligent framework for multimodal affective data analysis

被引:136
作者
Poria, Soujanya [1 ]
Cambria, Erik [2 ]
Hussain, Amir [1 ]
Huang, Guang-Bin [3 ]
机构
[1] Univ Stirling, Sch Nat Sci, Stirling FK9 4LA, Scotland
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Multimodal; Multimodal sentiment analysis; Facial expressions; Speech; Text; Emotion analysis; Affective computing; EXPRESSION RECOGNITION; EMOTION RECOGNITION; FACIAL EXPRESSION; COMMON-SENSE; FACE; SEQUENCES;
D O I
10.1016/j.neunet.2014.10.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An increasingly large amount of multimodal content is posted on social media websites such as YouTube and Facebook everyday. In order to cope with the growth of such so much multimodal data, there is an urgent need to develop an intelligent multi-modal analysis framework that can effectively extract information from multiple modalities. In this paper, we propose a novel multimodal information extraction agent, which infers and aggregates the semantic and affective information associated with user-generated multimodal data in contexts such as e-learning, e-health, automatic video content tagging and human computer interaction. In particular, the developed intelligent agent adopts an ensemble feature extraction approach by exploiting the joint use of tri-modal (text, audio and video) features to enhance the multimodal information extraction process. In preliminary experiments using the eNTERFACE dataset, our proposed multi-modal system is shown to achieve an accuracy of 87.95%, outperforming the best state-of-the-art system by more than 10%, or in relative terms, a 56% reduction in error rate. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 94 条
[1]  
Alm Cecilia Ovesdotter, 2005, P HUM LANG TECHN C C, P579, DOI DOI 10.3115/1220575.1220648
[2]  
[Anonymous], CBMI
[3]  
[Anonymous], 2007, P 4 INT WORKSH SEM E
[4]  
[Anonymous], BMVC
[5]  
[Anonymous], P ACM C APPL COMP AC
[6]  
[Anonymous], THEORY APPL NATURAL
[7]  
[Anonymous], THESIS U ILLINOIS
[8]  
[Anonymous], 2011, P ASS COMP LING ACL
[9]  
[Anonymous], P 1 IEEE WORKSH MULT
[10]  
[Anonymous], GENERATIVE LEXICON