ArcFace: Additive Angular Margin Loss for Deep Face Recognition

被引:169
作者
Deng, Jiankang [1 ]
Guo, Jia [2 ]
Yang, Jing [3 ]
Xue, Niannan [1 ]
Kotsia, Irene [4 ]
Zafeiriou, Stefanos [1 ]
机构
[1] Imperial Coll London, Dept Comp, London SW7 2BX, England
[2] InsightFace, London SW7 2AZ, England
[3] Univ Nottingham, Dept Comp Sci, Nottingham NG7 2RD, England
[4] Cogitat, London W10 5YU, England
基金
英国工程与自然科学研究理事会;
关键词
Large-scale face recognition; additive angular margin; noisy labels; sub-class; model inversion;
D O I
10.1109/TPAMI.2021.3087709
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, a popular line of research in face recognition is adopting margins in the well-established softmax loss function to maximize class separability. In this paper, we first introduce an Additive Angular Margin Loss (ArcFace), which not only has a clear geometric interpretation but also significantly enhances the discriminative power. Since ArcFace is susceptible to the massive label noise, we further propose sub-center ArcFace, in which each class contains K sub-centers and training samples only need to be close to any of the K positive sub-centers. Sub-center ArcFace encourages one dominant sub-class that contains the majority of clean faces and non-dominant sub-classes that include hard or noisy faces. Based on this self-propelled isolation, we boost the performance through automatically purifying raw web faces under massive real-world noise. Besides discriminative feature embedding, we also explore the inverse problem, mapping feature vectors to face images. Without training any additional generator or discriminator, the pre-trained ArcFace model can generate identity-preserved face images for both subjects inside and outside the training data only by using the network gradient and Batch Normalization (BN) priors. Extensive experiments demonstrate that ArcFace can enhance the discriminative feature embedding as well as strengthen the generative face synthesis.
引用
收藏
页码:5962 / 5979
页数:18
相关论文
共 121 条
  • [101] Xie W.., 2018, PROC BRIT MACH VIS C, P1332
  • [102] Comparator Networks
    Xie, Weidi
    Shen, Li
    Zisserman, Andrew
    [J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 811 - 826
  • [103] Yi D, 2014, ARXIV
  • [104] Yin HX, 2020, PROC CVPR IEEE, P8712, DOI 10.1109/CVPR42600.2020.00874
  • [105] Feature Transfer Learning for Face Recognition with Under-Represented Data
    Yin, Xi
    Yu, Xiang
    Sohn, Kihyuk
    Liu, Xiaoming
    Chandraker, Manmohan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5697 - 5706
  • [106] Yonghyun Kim, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P536, DOI 10.1007/978-3-030-58545-7_31
  • [107] Yosinski J., 2015, ARXIV
  • [108] Zhang Q., 2019, ARXIV190705653
  • [109] AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations
    Zhang, Xiao
    Zhao, Rui
    Qiao, Yu
    Wang, Xiaogang
    Li, Hongsheng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10815 - 10824
  • [110] Range Loss for Deep Face Recognition with Long-Tailed Training Data
    Zhang, Xiao
    Fang, Zhiyuan
    Wen, Yandong
    Li, Zhifeng
    Qiao, Yu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5419 - 5428