Enhancing Face Recognition With Detachable Self-Supervised Bypass Networks

被引:5
作者
He, Mingjie [1 ,2 ]
Zhang, Jie [1 ,2 ,3 ]
Shan, Shiguang [1 ,2 ]
Chen, Xilin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc Chinese Acad Sci, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Inst Intelligent Comp Technol, Suzhou 215000, Peoples R China
关键词
Face recognition; Task analysis; Three-dimensional displays; Training; Supervised learning; Self-supervised learning; Image reconstruction; bypass enhanced representation learning; 3D reconstruction bypass; blind inpainting bypass;
D O I
10.1109/TIP.2024.3364067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attributed to the development of deep networks and abundant data, automatic face recognition (FR) has quickly reached human-level capacity in the past few years. However, the FR problem is not perfectly solved in case of large poses and uncontrolled occlusions. In this paper, we propose a novel bypass enhanced representation learning (BERL) method to improve face recognition under unconstrained scenarios. The proposed method integrates self-supervised learning and supervised learning together by attaching two auxiliary bypasses, a 3D reconstruction bypass and a blind inpainting bypass, to assist robust feature learning for face recognition. Among them, the 3D reconstruction bypass enforces the face recognition network to encode pose independent 3D facial information, which enhances the robustness to various poses. The blind inpainting bypass enforces the face recognition network to capture more facial context information for face inpainting, which enhances the robustness to occlusions. The whole framework is trained in end-to-end manner with two self-supervised tasks above and the classic supervised face identification task. During inference, the two auxiliary bypasses can be detached from the face recognition network, avoiding any additional computational overhead. Extensive experimental results on various face recognition benchmarks show that, without any cost of extra annotations and computations, our method outperforms state-of-the-art methods. Moreover, the learnt representations can also well generalize to other face-related downstream tasks such as the facial attribute recognition with limited labeled data.
引用
收藏
页码:1588 / 1599
页数:12
相关论文
共 52 条
[1]  
An X., 2020, arXiv
[2]   Blind inpainting using the fully convolutional neural network [J].
Cai, Nian ;
Su, Zhenghang ;
Lin, Zhineng ;
Wang, Han ;
Yang, Zhijing ;
Ling, Bingo Wing-Kuen .
VISUAL COMPUTER, 2017, 33 (02) :249-261
[3]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[4]   Data Uncertainty Learning in Face Recognition [J].
Chang, Jie ;
Lan, Zhonghao ;
Cheng, Changmao ;
Wei, Yichen .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5709-5718
[5]  
Chen Ting, 2019, ICML
[6]  
Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
[7]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Xue, Niannan ;
Zafeiriou, Stefanos .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694
[8]  
DeVries T, 2017, Arxiv, DOI arXiv:1708.04552
[9]   Fast-GANFIT: Generative Adversarial Network for High Fidelity 3D Face Reconstruction [J].
Gecer, Baris ;
Ploumpis, Stylianos ;
Kotsia, Irene ;
Zafeiriou, Stefanos .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4879-4893
[10]  
Gidaris P., 2018, INT C LEARNREPRESENT