Multimodal Emotion Recognition with Automatic Peak Frame Selection

被引:0
作者
Zhalehpour, Sara [1 ]
Akhtar, Zahid [2 ]
Erdem, Cigdem Eroglu [1 ]
机构
[1] Bahcesehir Univ, Dept Elect & Elect Engn, Istanbul, Turkey
[2] Univ Udine, Dept Math & Comp Sci, Udine, Italy
来源
2014 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA 2014) | 2014年
关键词
multimodal emotion recognition; peak frame selection; decision level fusion; affective computing; FUSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an effective framework for multimodal emotion recognition based on a novel approach for automatic peak frame selection from audio-visual video sequences. Given a video with an emotional expression, peak frames are the ones at which the emotion is at its apex. The objective of peak frame selection is to make the training process for the automatic emotion recognition system easier by summarizing the expressed emotion over a video sequence. The main steps of the proposed framework consists of extraction of video and audio features based on peak frame selection, unimodal classification and decision level fusion of audio and visual results. We evaluated the performance of our approach on eNTERFACE'05 audio-visual database containing six basic emotional classes. Experimental results demonstrate the effectiveness and superiority of the proposed system over other methods in the literature.
引用
收藏
页码:116 / 121
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2000, STUDIES
[2]  
[Anonymous], 2013, MULTIMEDIA EXPO ICME
[3]   Multimodal fusion for multimedia analysis: a survey [J].
Atrey, Pradeep K. ;
Hossain, M. Anwar ;
El Saddik, Abdulmotaleb ;
Kankanhalli, Mohan S. .
MULTIMEDIA SYSTEMS, 2010, 16 (06) :345-379
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]  
Datcu Dragos., 2011, P 12 INT C COMP SYST, P122
[6]   Survey on speech emotion recognition: Features, classification schemes, and databases [J].
El Ayadi, Moataz ;
Kamel, Mohamed S. ;
Karray, Fakhri .
PATTERN RECOGNITION, 2011, 44 (03) :572-587
[7]   Automatic facial expression analysis: a survey [J].
Fasel, B ;
Luettin, J .
PATTERN RECOGNITION, 2003, 36 (01) :259-275
[8]  
Gajsek Rok, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P4133, DOI 10.1109/ICPR.2010.1005
[9]   RASTA Processing of Speech [J].
Hermansky, Hynek ;
Morgan, Nelson .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :578-589
[10]  
Kroon B., 2008, Proceedings of the 8th IEEE International Conference on Automatic Face Gesture Recognition, P1