Feature and Decision Level Audio-visual Data Fusion in Emotion Recognition Problem

被引:0
作者
Sidorov, Maxim [1 ]
Sopov, Evgenii [2 ]
Ivanov, Ilia [2 ]
Minker, Wolfgang [1 ]
机构
[1] Univ Ulm, Inst Commun Engn, Ulm, Germany
[2] Siberian State Aerosp Univ, Dept Syst Anal & Operat Res, Krasnoyarsk, Russia
来源
ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 2 | 2015年
关键词
Emotion Recognition; Speech; Vision; PCA; Neural Network; Human-Computer Interaction (HCI); Feature Level Fusion; Decision Level Fusion;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The speech-based emotion recognition problem has already been investigated by many authors, and reasonable results have been achieved. This article focuses on applying audio-visual data fusion approach to emotion recognition. Two state-of-the-art classification algorithms were applied to one audio and three visual feature datasets. Feature level data fusion was applied to build a multimodal emotion classification system, which helped increase emotion classification accuracy by 4% compared to the best accuracy achieved by unimodal systems. The class precisions achieved by applying algorithms on unimodal and multimodal datasets helped to reveal that different data-classifier combinations are good at recognizing certain emotions. These data-classifier combinations were fused on the decision level using several approaches, which still helped increase the accuracy by 3% compared to the best accuracy achieved by feature level fusion.
引用
收藏
页码:246 / 251
页数:6
相关论文
共 9 条
[1]  
[Anonymous], 2009, P INT C AUD VIS SPEE
[2]  
Busso C., 2004, ANAL EMOTION RECOGNI
[3]  
Cruz A., 2012, P 21 INT C PATT REC
[4]  
Eyben F, 2010, P 18 ACM INT C MULT, P1459, DOI [DOI 10.1145/1873951.1874246, 10.1145/1873951.1874246]
[5]  
Kahou S. E., 2013, ICMI 13
[6]  
Platt J., 1998, SEQUENTIAL MINIMAL O
[7]   Human emotion recognition from videos using spatio-temporal and audio features [J].
Rashid, Munaf ;
Abu-Bakar, S. A. R. ;
Mokji, Musa .
VISUAL COMPUTER, 2013, 29 (12) :1269-1275
[8]  
Sariyanidi E., 2013, BMVC 13
[9]  
Soleymani Mohammad., 2012, IEEE Transactions on Affective Computing (TAC), V3