Exploring Multimodal Visual Features for Continuous Affect Recognition

被引:23
作者
Sun, Bo [1 ]
Cao, Siming [1 ]
Li, Liandong [1 ]
He, Jun [1 ]
Yu, Lejun [1 ]
机构
[1] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
来源
PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON AUDIO/VISUAL EMOTION CHALLENGE (AVEC'16) | 2016年
关键词
Continuous Emotion Recognition; CNN; Multimodal Features; SVR; Residual Network;
D O I
10.1145/2988257.2988270
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents our work in the Emotion Sub-Challenge of the 6th Audio/Visual Emotion Challenge and Workshop (AVEC 2016), whose goal is to explore utilizing audio, visual and physiological signals to continuously predict the value of the emotion dimensions (arousal and valence). As visual features are very important in emotion recognition, we try a variety of handcrafted and deep visual features. For each video clip, besides the baseline features, we extract multi-scale Dense SIFT features (MSDF), and some types of Convolutional neural networks (CNNs) features to recognize the expression phases of the current frame. We train linear Support Vector Regression (SVR) for every kind of features on the RECOLA dataset. Multimodal fusion of these modalities is then performed with a multiple linear regression model. The final Concordance Correlation Coefficient (CCC) we gained on the development set are 0.824 for arousal, and 0.718 for valence; and on the test set are 0.683 for arousal and 0.642 for valence.
引用
收藏
页码:83 / 88
页数:6
相关论文
共 32 条
  • [1] Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition
    Almaev, Timur R.
    Valstar, Michel F.
    [J]. 2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 356 - 361
  • [2] THIN SLICES OF EXPRESSIVE BEHAVIOR AS PREDICTORS OF INTERPERSONAL CONSEQUENCES - A METAANALYSIS
    AMBADY, N
    ROSENTHAL, R
    [J]. PSYCHOLOGICAL BULLETIN, 1992, 111 (02) : 256 - 274
  • [3] [Anonymous], P 2 INT WORKSH EM RE
  • [4] [Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
  • [5] [Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
  • [6] [Anonymous], P 2015 ACM INT C MUL
  • [7] [Anonymous], 2016, AVEC 2016 DEPR MOOD
  • [8] [Anonymous], 1997, Neural Computation
  • [9] [Anonymous], ADV NEURAL INF PROCE
  • [10] [Anonymous], IEEE T AFF IN PRESS