Multimodal Fusion based on Information Gain for Emotion Recognition in the Wild

被引：0

作者：

Ghaleb, Esam ^{[1
]}

Popa, Mirela ^{[1
]}

Hortal, Enrique ^{[1
]}

Asteriadis, Stylianos ^{[1
]}

机构：

[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Maastricht, Netherlands

来源：

PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) | 2017年

基金：

欧盟地平线“2020”;

关键词：

Emotion recognition; multimodal fusion; information gain; genetic algorithm; EXPRESSION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a novel approach towards multi-modal emotion recognition on a challenging dataset AFEW'16, composed of video clips labeled with the six basic emotions plus the neutral state. After a preprocessing stage, we employ different feature extraction techniques (CNN, DSIFT on face and facial ROI, geometric and audio based) and encoded frame-based features using Fisher vector representations. Next, we leverage the properties of each modality using different fusion schemes. Apart from the early-level fusion and the decision level fusion approaches, we propose a hierarchical decision level method based on information gain principles and we optimize its parameters using genetic algorithms. The experimental results prove the suitability of our method, as we obtain 53.06% validation accuracy, surpassing by 14% the baseline of 38.81% on a challenging dataset, suitable for emotion recognition in the wild.

引用

页码：814 / 823

页数：10

共 47 条

[1]

[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224

[2]

Asthana A, 2009, P IEEE INT C AFF COM, P598

[3] Emotion Recognition in the Wild from Videos using Images [J].

Bargal, Sarah Adel ;

Barsoum, Emad ;

Ferrer, Cristian Canton ;

Zhang, Cha .

ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :433-436

[4]

Batliner Anton., 2006, Proc. IS-LTC 2006, P240

[5]

Berretti S., 2010, P INT C PATTERN RECO, P4125

[6] Video Emotion Recognition in the Wild Based on Fusion of Multimodal Features [J].

Chen, Shizhe ;

Li, Xinrui ;

Jin, Qin ;

Zhang, Shilei ;

Qin, Yong .

ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :494-500

[7] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[8]

Dhall A., 2011, TRCS11 AUSTR NAT U, V2

[9]

Dhall A., 2013, ACM C MULT INT ICMI

[10] EmotiW 2016: Video and Group-Level Emotion Recognition Challenges [J].

Dhall, Abhinav ;

Goecke, Roland ;

Joshi, Jyoti ;

Hoey, Jesse ;

Gedeon, Tom .

ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :427-432

← 1 2 3 4 5 →