Multimodal Fusion based on Information Gain for Emotion Recognition in the Wild

被引:0
作者
Ghaleb, Esam [1 ]
Popa, Mirela [1 ]
Hortal, Enrique [1 ]
Asteriadis, Stylianos [1 ]
机构
[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Maastricht, Netherlands
来源
PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) | 2017年
基金
欧盟地平线“2020”;
关键词
Emotion recognition; multimodal fusion; information gain; genetic algorithm; EXPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a novel approach towards multi-modal emotion recognition on a challenging dataset AFEW'16, composed of video clips labeled with the six basic emotions plus the neutral state. After a preprocessing stage, we employ different feature extraction techniques (CNN, DSIFT on face and facial ROI, geometric and audio based) and encoded frame-based features using Fisher vector representations. Next, we leverage the properties of each modality using different fusion schemes. Apart from the early-level fusion and the decision level fusion approaches, we propose a hierarchical decision level method based on information gain principles and we optimize its parameters using genetic algorithms. The experimental results prove the suitability of our method, as we obtain 53.06% validation accuracy, surpassing by 14% the baseline of 38.81% on a challenging dataset, suitable for emotion recognition in the wild.
引用
收藏
页码:814 / 823
页数:10
相关论文
共 47 条
[1]  
[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
[2]  
Asthana A, 2009, P IEEE INT C AFF COM, P598
[3]   Emotion Recognition in the Wild from Videos using Images [J].
Bargal, Sarah Adel ;
Barsoum, Emad ;
Ferrer, Cristian Canton ;
Zhang, Cha .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :433-436
[4]  
Batliner Anton., 2006, Proc. IS-LTC 2006, P240
[5]  
Berretti S., 2010, P INT C PATTERN RECO, P4125
[6]   Video Emotion Recognition in the Wild Based on Fusion of Multimodal Features [J].
Chen, Shizhe ;
Li, Xinrui ;
Jin, Qin ;
Zhang, Shilei ;
Qin, Yong .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :494-500
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]  
Dhall A., 2011, TRCS11 AUSTR NAT U, V2
[9]  
Dhall A., 2013, ACM C MULT INT ICMI
[10]   EmotiW 2016: Video and Group-Level Emotion Recognition Challenges [J].
Dhall, Abhinav ;
Goecke, Roland ;
Joshi, Jyoti ;
Hoey, Jesse ;
Gedeon, Tom .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :427-432