ArcFace: Additive Angular Margin Loss for Deep Face Recognition

被引：169

作者：

Deng, Jiankang ^{[1
]}

Guo, Jia ^{[2
]}

Yang, Jing ^{[3
]}

Xue, Niannan ^{[1
]}

Kotsia, Irene ^{[4
]}

Zafeiriou, Stefanos ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, London SW7 2BX, England

[2] InsightFace, London SW7 2AZ, England

[3] Univ Nottingham, Dept Comp Sci, Nottingham NG7 2RD, England

[4] Cogitat, London W10 5YU, England

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 10期

基金：

英国工程与自然科学研究理事会;

关键词：

Large-scale face recognition; additive angular margin; noisy labels; sub-class; model inversion;

D O I：

10.1109/TPAMI.2021.3087709

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, a popular line of research in face recognition is adopting margins in the well-established softmax loss function to maximize class separability. In this paper, we first introduce an Additive Angular Margin Loss (ArcFace), which not only has a clear geometric interpretation but also significantly enhances the discriminative power. Since ArcFace is susceptible to the massive label noise, we further propose sub-center ArcFace, in which each class contains K sub-centers and training samples only need to be close to any of the K positive sub-centers. Sub-center ArcFace encourages one dominant sub-class that contains the majority of clean faces and non-dominant sub-classes that include hard or noisy faces. Based on this self-propelled isolation, we boost the performance through automatically purifying raw web faces under massive real-world noise. Besides discriminative feature embedding, we also explore the inverse problem, mapping feature vectors to face images. Without training any additional generator or discriminator, the pre-trained ArcFace model can generate identity-preserved face images for both subjects inside and outside the training data only by using the network gradient and Batch Normalization (BN) priors. Extensive experiments demonstrate that ArcFace can enhance the discriminative feature embedding as well as strengthen the generative face synthesis.

引用

页码：5962 / 5979

页数：18

共 121 条

[101] Xie W.., 2018, PROC BRIT MACH VIS C, P1332
[102] Comparator Networks
Xie, Weidi
Shen, Li
Zisserman, Andrew
[J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 811 - 826
[103] Yi D, 2014, ARXIV
[104] Yin HX, 2020, PROC CVPR IEEE, P8712, DOI 10.1109/CVPR42600.2020.00874
[105] Feature Transfer Learning for Face Recognition with Under-Represented Data
Yin, Xi
Yu, Xiang
Sohn, Kihyuk
Liu, Xiaoming
Chandraker, Manmohan
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5697 - 5706
[106] Yonghyun Kim, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P536, DOI 10.1007/978-3-030-58545-7_31
[107] Yosinski J., 2015, ARXIV
[108] Zhang Q., 2019, ARXIV190705653
[109] AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations
Zhang, Xiao
Zhao, Rui
Qiao, Yu
Wang, Xiaogang
Li, Hongsheng
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10815 - 10824
[110] Range Loss for Deep Face Recognition with Long-Tailed Training Data
Zhang, Xiao
Fang, Zhiyuan
Wen, Yandong
Li, Zhifeng
Qiao, Yu
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5419 - 5428

← 4 5 6 7 8 9 10 11 12 13 →