Hard negative generation for identity-disentangled facial expression recognition

被引：76

作者：

Liu, Xiaofeng ^{[1
,2
,3
]}

Kumar, B. V. K. Vijaya ^{[3
,4
]}

Jia, Ping ^{[1
,2
]}

You, Jane ^{[1
,2
,5
]}

机构：

[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun, Jilin, Peoples R China

[2] Univ Chinese Acad Sci, Beijing, Peoples R China

[3] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA

[4] Carnegie Mellon Univ Africa, Kigali, Rwanda

[5] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

PATTERN RECOGNITION | 2019年 / 88卷

关键词：

Hard negative generation; Adaptive metric learning; Face normalization; Facial expression recognition;

D O I：

10.1016/j.patcog.2018.11.001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Various factors such as identity-specific attributes, pose, illumination and expression affect the appearance of face images. Disentangling the identity-specific factors is potentially beneficial for facial expression recognition (FER). Existing image-based FER systems either use hand-crafted or learned features to represent a single face image. In this paper, we propose a novel FER framework, named identity disentangled facial expression recognition machine (IDFERM), in which we untangle the identity from a query sample by exploiting its difference from its references (e.g., its mined or generated frontal and neutral normalized faces). We demonstrate a possible 'recognition via generation' scheme which consists of a novel hard negative generation (HNG) network and a generalized radial metric learning (RML) network. For FER, generated normalized faces are used as hard negative samples for metric learning. The difficulty of threshold validation and anchor selection are alleviated in RML and its distance comparisons are fewer than those of traditional deep metric learning methods. The expression representations of RML achieve superior performance on the CK +, MMI and Oulu-CASIA datasets, given a single query image for testing. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：1 / 12

页数：12

共 47 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2003, EXPLORING ARTIFICIAL

[3]

[Anonymous], 2006, Pattern Recognition and Machine Learning

[4]

[Anonymous], 2005, The New Handbook of Methods in Nonverbal Behavior Research

[5]

[Anonymous], 2016, NIPS 16 P 30 INT C N, DOI DOI 10.5555/3157096.3157304

[6]

[Anonymous], 2017, CVPR

[7] Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution [J].

Barsoum, Emad ;

Zhang, Cha ;

Ferrer, Cristian Canton ;

Zhang, Zhengyou .

ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :279-283

[8] Learning a similarity metric discriminatively, with application to face verification [J].

Chopra, S ;

Hadsell, R ;

LeCun, Y .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546

[9] Selective Transfer Machine for Personalized Facial Expression Analysis [J].

Chu, Wen-Sheng ;

De la Torre, Fernando ;

Cohn, Jeffrey F. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (03) :529-545

[10] From one to many: Pose-Aware Metric Learning for single-sample face recognition [J].

Deng, Weihong ;

Hu, Jiani ;

Wu, Zhongjun ;

Guo, Jun .

PATTERN RECOGNITION, 2018, 77 :426-437

← 1 2 3 4 5 →