Confidence-Weighted Local Expression Predictions for Occlusion Handling in Expression Recognition and Action Unit Detection

被引:41
作者
Dapogny, Arnaud [1 ]
Bailly, Kevin [1 ]
Dubuisson, Severine [1 ]
机构
[1] Sorbonne Univ, UPMC Univ Paris 06, CNRS, UMR 7222, F-75005 Paris, France
关键词
Facial expressions; Action unit; Random forest; Occlusions; Autoencoder; Real-time; FACE; ALIGNMENT;
D O I
10.1007/s11263-017-1010-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fully-automatic facial expression recognition (FER) is a key component of human behavior analysis. Performing FER from still images is a challenging task as it involves handling large interpersonal morphological differences, and as partial occlusions can occasionally happen. Furthermore, labelling expressions is a time-consuming process that is prone to subjectivity, thus the variability may not be fully covered by the training data. In this work, we propose to train random forests upon spatially-constrained random local subspaces of the face. The output local predictions form a categorical expression-driven high-level representation that we call local expression predictions (LEPs). LEPs can be combined to describe categorical facial expressions as well as action units (AUs). Furthermore, LEPs can be weighted by confidence scores provided by an autoencoder network. Such network is trained to locally capture the manifold of the non-occluded training data in a hierarchical way. Extensive experiments show that the proposed LEP representation yields high descriptive power for categorical expressions and AU occurrence prediction, and leads to interesting perspectives towards the design of occlusion-robust and confidence-aware FER systems.
引用
收藏
页码:255 / 271
页数:17
相关论文
共 51 条
  • [21] Dhall A., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P2106, DOI 10.1109/ICCVW.2011.6130508
  • [22] CONSTANTS ACROSS CULTURES IN FACE AND EMOTION
    EKMAN, P
    FRIESEN, WV
    [J]. JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1971, 17 (02) : 124 - &
  • [23] Discriminative Shared Gaussian Processes for Multiview and View-Invariant Facial Expression Recognition
    Eleftheriadis, Stefanos
    Rudovic, Ognjen
    Pantic, Maja
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (01) : 189 - 204
  • [24] Occlusion Coherence: Localizing Occluded Faces with a Hierarchical Deformable Part Model
    Ghiasi, Golnaz
    Fowlkes, Charless C.
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1899 - 1906
  • [25] Greenwald M.K., 1989, Journal of Psychophysiology, V3, P51
  • [26] Hayat M, 2012, C HUM SYST INTERACT, P43, DOI 10.1109/HSI.2012.16
  • [27] Towards a dynamic expression recognition system under facial occlusion
    Huang, Xiaohua
    Zhao, Guoying
    Zheng, Wenming
    Pietikainen, Matti
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (16) : 2181 - 2191
  • [28] Jeni L.A., 2015, FG
  • [29] An analysis of facial expression recognition under partial facial image occlusion
    Kotsia, Irene
    Buciu, Loan
    Pitas, Loannis
    [J]. IMAGE AND VISION COMPUTING, 2008, 26 (07) : 1052 - 1067
  • [30] Linusson H., 2013, Multi-output random forests