Multimodal Fusion based on Information Gain for Emotion Recognition in the Wild

被引:0
作者
Ghaleb, Esam [1 ]
Popa, Mirela [1 ]
Hortal, Enrique [1 ]
Asteriadis, Stylianos [1 ]
机构
[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Maastricht, Netherlands
来源
PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) | 2017年
基金
欧盟地平线“2020”;
关键词
Emotion recognition; multimodal fusion; information gain; genetic algorithm; EXPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a novel approach towards multi-modal emotion recognition on a challenging dataset AFEW'16, composed of video clips labeled with the six basic emotions plus the neutral state. After a preprocessing stage, we employ different feature extraction techniques (CNN, DSIFT on face and facial ROI, geometric and audio based) and encoded frame-based features using Fisher vector representations. Next, we leverage the properties of each modality using different fusion schemes. Apart from the early-level fusion and the decision level fusion approaches, we propose a hierarchical decision level method based on information gain principles and we optimize its parameters using genetic algorithms. The experimental results prove the suitability of our method, as we obtain 53.06% validation accuracy, surpassing by 14% the baseline of 38.81% on a challenging dataset, suitable for emotion recognition in the wild.
引用
收藏
页码:814 / 823
页数:10
相关论文
共 47 条
  • [1] [Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
  • [2] Asthana A, 2009, P IEEE INT C AFF COM, P598
  • [3] Emotion Recognition in the Wild from Videos using Images
    Bargal, Sarah Adel
    Barsoum, Emad
    Ferrer, Cristian Canton
    Zhang, Cha
    [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 433 - 436
  • [4] Batliner Anton., 2006, Proc. IS-LTC 2006, P240
  • [5] Berretti S., 2010, P INT C PATTERN RECO, P4125
  • [6] Video Emotion Recognition in the Wild Based on Fusion of Multimodal Features
    Chen, Shizhe
    Li, Xinrui
    Jin, Qin
    Zhang, Shilei
    Qin, Yong
    [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 494 - 500
  • [7] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [8] Dhall A., 2011, TRCS11 AUSTR NAT U, V2
  • [9] Dhall A., 2013, ACM C MULT INT ICMI
  • [10] EmotiW 2016: Video and Group-Level Emotion Recognition Challenges
    Dhall, Abhinav
    Goecke, Roland
    Joshi, Jyoti
    Hoey, Jesse
    Gedeon, Tom
    [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 427 - 432