Confidence-Weighted Local Expression Predictions for Occlusion Handling in Expression Recognition and Action Unit Detection

被引：43

作者：

Dapogny, Arnaud ^{[1
]}

Bailly, Kevin ^{[1
]}

Dubuisson, Severine ^{[1
]}

机构：

[1] Sorbonne Univ, UPMC Univ Paris 06, CNRS, UMR 7222, F-75005 Paris, France

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2018年 / 126卷 / 2-4期

关键词：

Facial expressions; Action unit; Random forest; Occlusions; Autoencoder; Real-time; FACE; ALIGNMENT;

D O I：

10.1007/s11263-017-1010-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fully-automatic facial expression recognition (FER) is a key component of human behavior analysis. Performing FER from still images is a challenging task as it involves handling large interpersonal morphological differences, and as partial occlusions can occasionally happen. Furthermore, labelling expressions is a time-consuming process that is prone to subjectivity, thus the variability may not be fully covered by the training data. In this work, we propose to train random forests upon spatially-constrained random local subspaces of the face. The output local predictions form a categorical expression-driven high-level representation that we call local expression predictions (LEPs). LEPs can be combined to describe categorical facial expressions as well as action units (AUs). Furthermore, LEPs can be weighted by confidence scores provided by an autoencoder network. Such network is trained to locally capture the manifold of the non-occluded training data in a hierarchical way. Extensive experiments show that the proposed LEP representation yields high descriptive power for categorical expressions and AU occurrence prediction, and leads to interesting perspectives towards the design of occlusion-robust and confidence-aware FER systems.

引用

页码：255 / 271

页数：17

共 51 条

[21]

Dhall A., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P2106, DOI 10.1109/ICCVW.2011.6130508

[22] CONSTANTS ACROSS CULTURES IN FACE AND EMOTION [J].

EKMAN, P ;

FRIESEN, WV .

JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1971, 17 (02) :124-&

[23] Discriminative Shared Gaussian Processes for Multiview and View-Invariant Facial Expression Recognition [J].

Eleftheriadis, Stefanos ;

Rudovic, Ognjen ;

Pantic, Maja .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (01) :189-204

[24] Occlusion Coherence: Localizing Occluded Faces with a Hierarchical Deformable Part Model [J].

Ghiasi, Golnaz ;

Fowlkes, Charless C. .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1899-1906

[25]

Greenwald M.K., 1989, Journal of Psychophysiology, V3, P51

[26]

Hayat M, 2012, C HUM SYST INTERACT, P43, DOI 10.1109/HSI.2012.16

[27] Towards a dynamic expression recognition system under facial occlusion [J].

Huang, Xiaohua ;

Zhao, Guoying ;

Zheng, Wenming ;

Pietikainen, Matti .

PATTERN RECOGNITION LETTERS, 2012, 33 (16) :2181-2191

[28]

Jeni L.A., 2015, FG

[29] An analysis of facial expression recognition under partial facial image occlusion [J].

Kotsia, Irene ;

Buciu, Loan ;

Pitas, Loannis .

IMAGE AND VISION COMPUTING, 2008, 26 (07) :1052-1067

[30]

Linusson H., 2013, Multi-output random forests

← 1 2 3 4 5 6 →