Learning a Facial Expression Embedding Disentangled from Identity

被引:47
作者
Zhang, Wei [1 ]
Ji, Xianpeng [1 ]
Chen, Keyu [2 ]
Ding, Yu [1 ]
Fan, Changjie [1 ]
机构
[1] Netease Fuxi AI Lab, Virtual Human Grp, Beijing, Peoples R China
[2] Univ Sci & Technol China, Langfang, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
MODEL;
D O I
10.1109/CVPR46437.2021.00669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The facial expression analysis requires a compact and identity-ignored expression representation. In this paper, we model the expression as the deviation from the identity by a subtraction operation, extracting a continuous and identity-invariant expression embedding. We propose a Deviation Learning Network (DLN) with a pseudo-siamese structure to extract the deviation feature vector. To reduce the optimization difficulty caused by additional fully connection layers, DLN directly provides high-order polynomial to nonlinearly project the high-dimensional feature to a low-dimensional manifold. Taking label noise into account, we add a crowd layer to DLN for robust embedding extraction. Also, to achieve a more compact representation, we use hierarchical annotation for data augmentation. We evaluate our facial expression embedding on the FEC validation set. The quantitative results prove that we achieve the state-of-the-art, both in terms of fine-grained and identity-invariant property. We further conduct extensive experiments to show that our expression embedding is of high quality for expression recognition, image retrieval, and face manipulation.
引用
收藏
页码:6755 / 6764
页数:10
相关论文
共 41 条
[1]  
[Anonymous], 2017, ARXIV170208591
[2]  
[Anonymous], 2019, ARXIV190609529
[3]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[4]   Higher-order Integration of Hierarchical Convolutional Activations for Fine-grained Visual Categorization [J].
Cai, Sijia ;
Zuo, Wangmeng ;
Zhang, Lei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :511-520
[5]   VGGFace2: A dataset for recognising faces across pose and age [J].
Cao, Qiong ;
Shen, Li ;
Xie, Weidi ;
Parkhi, Omkar M. ;
Zisserman, Andrew .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74
[6]   Manifold based analysis of facial expression [J].
Chang, Ya ;
Hu, Changbo ;
Feris, Rogerio ;
Turk, Matthew .
IMAGE AND VISION COMPUTING, 2006, 24 (06) :605-614
[7]  
Chen X, 2016, 30 C NEURAL INFORM P, V29
[8]   A Compact Dual Band-Notched And Dual-Polarization Antenna for Base Station [J].
Chen, Yuan-Long ;
Chu, Qing-Xin .
2019 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT 2019), 2019,
[9]   StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].
Choi, Yunjey ;
Choi, Minje ;
Kim, Munyoung ;
Ha, Jung-Woo ;
Kim, Sunghun ;
Choo, Jaegul .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797
[10]   Intermodal group-velocity engineering for broadband nonlinear optics [J].
Demas, Jeff ;
Rishoj, Lars ;
Liu, Xiao ;
Prabhakar, Gautam ;
Ramachandran, Siddharth .
PHOTONICS RESEARCH, 2019, 7 (01) :1-7