Spontaneous facial expression recognition: A robust metric learning approach

被引：66

作者：

Wan, Shaohua ^{[1
]}

Aggarwal, J. K. ^{[1
]}

机构：

[1] Univ Texas Austin, Comp Vis Res Ctr, Austin, TX 78712 USA

来源：

PATTERN RECOGNITION | 2014年 / 47卷 / 05期

关键词：

Spontaneous facial expression recognition; Metric learning; Online learning; Robust learning;

D O I：

10.1016/j.patcog.2013.11.025

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spontaneous facial expression recognition is significantly more challenging than recognizing posed ones. We focus on two issues that are still under-addressed in this area. First, due to the inherent subtlety, the geometric and appearance features of spontaneous expressions tend to overlap with each other, making it hard for classifiers to find effective separation boundaries. Second, the training set usually contains dubious class labels which can hurt the recognition performance if no countermeasure is taken. In this paper, we propose a spontaneous expression recognition method based on robust metric learning with the aim of alleviating these two problems. In particular, to increase the discrimination of different facial expressions, we learn a new metric space in which spatially close data points have a higher probability of being in the same class. In addition, instead of using the noisy labels directly for metric learning, we define sensitivity and specificity to characterize the annotation reliability of each annotator. Then the distance metric and annotators' reliability is jointly estimated by maximizing the likelihood of the observed class labels. With the introduction of latent variables representing the true class labels, the distance metric and annotators' reliability can be iteratively solved under the Expectation Maximization framework. Comparative experiments show that our method achieves better recognition accuracy on spontaneous expression recognition, and the learned metric can be reliably transferred to recognize posed expressions. (C) 2013 Elsevier Ltd. All rights reserved.

引用

页码：1859 / 1868

页数：10

共 46 条

[1]

[Anonymous], 2011, International Journal of Wavelets Multiresolution and Information Processing, DOI DOI 10.1142/S021969130400041X

[2]

[Anonymous], 2007, Proceedings of the 20th International Conference on Neural Information Processing Systems

[3]

[Anonymous], 2013, AUT FAC GEST REC FG

[4] Recognition of facial expressions using Gabor wavelets and learning vector quantization [J].

Bashyal, Shishir ;

Venayagamoorthy, Ganesh K. .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2008, 21 (07) :1056-1064

[5]

Boyd S.P, 2004, Convex optimization, DOI [DOI 10.1017/CBO9780511804441, 10.1017/CBO9780511804441]

[6] LIBSVM: A Library for Support Vector Machines [J].

Chang, Chih-Chung ;

Lin, Chih-Jen .

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)

[7] Hybrid-boost learning for multi-pose face detection and facial expression recognition [J].

Chen, Hsiuao-Ying ;

Huang, Chung-Lin ;

Fu, Chih-Ming .

PATTERN RECOGNITION, 2008, 41 (03) :1173-1185

[8] Active appearance models [J].

Cootes, TF ;

Edwards, GJ ;

Taylor, CJ .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) :681-685

[9]

Das S., 2012, INT J COMPUT APPL, V45, P11

[10] COMPLETE DISCRETE 2-D GABOR TRANSFORMS BY NEURAL NETWORKS FOR IMAGE-ANALYSIS AND COMPRESSION [J].

DAUGMAN, JG .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (07) :1169-1179

← 1 2 3 4 5 →