Multi-Modal Emotion Recognition Fusing Video and Audio

被引:4
作者
Xu, Chao [1 ]
Du, Pufeng [2 ]
Feng, Zhiyong [2 ]
Meng, Zhaopeng [1 ]
Cao, Tianyi [2 ]
Dong, Caichao [2 ]
机构
[1] Tianjin Univ, Sch Comp Software, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
来源
APPLIED MATHEMATICS & INFORMATION SCIENCES | 2013年 / 7卷 / 02期
基金
美国国家科学基金会;
关键词
Emotion Recognition; Multi-modal Fusion; HMM; Multi-layer Perceptron;
D O I
10.12785/amis/070205
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Emotion plays an important role in human communications. We construct a framework for multi-modal fusion emotion recognition. Facial expression features and speech features are respectively extracted from image sequences and speech signals. In order to locate and track facial feature points, we construct an Active Appearance Model for facial images with all kinds of expressions. Facial Animation Parameters are calculated from motions of facial feature points as expression features. We extract short-term mean energy, fundamental frequency and formant frequencies from each frame as speech features. An emotion classifier is designed to fuse facial expression and speech based on Hidden Markov Models and Multi-layer Perceptron. Experiments indicate that multi-modal fusion emotion recognition algorithm which is presented in this paper has relatively high recognition accuracy. The proposed approach has better performance and robustness than methods using only video or audio separately.
引用
收藏
页码:455 / 462
页数:8
相关论文
共 16 条
  • [1] [Anonymous], 1994, The Nature of Emotion: Fundamental Questions
  • [3] Ekman P., 1978, Facial action coding system: a technique for the measurement of facial movement
  • [4] A neural network facial expression recognition system using unsupervised local processing
    Franco, L
    Treves, A
    [J]. ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 628 - 632
  • [5] Go H.J., 2003, P INT C SOC INSTRUME, P2890
  • [6] Lin YL, 2005, PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, P4898
  • [7] Lucey P., 2010, 2010 IEEE COMPUTER S, P94, DOI DOI 10.1109/CVPRW.2010.5543262
  • [8] Xue-guang W., 2012, INF SCI LETT, V1, P77, DOI [10.12785/isl/010202, DOI 10.12785/ISL/010202]
  • [9] Yang G., 2008, MICROCOMPUTER INFORM, P284
  • [10] Ye J., 2012, COMPUTER ENG APPL, V48, P119