Orthogonality Loss: Learning Discriminative Representations for Face Recognition

被引:21
作者
Yang, Shanming [1 ]
Deng, Weihong [1 ]
Wang, Mei [1 ]
Du, Junping [2 ]
Hu, Jiani [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Comp Sci & Technol, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Face recognition; Face; Feature extraction; Training; Robustness; Matrix decomposition; Benchmark testing; discriminative representations; orthogonality; inter-class distance; MARGIN SOFTMAX; DEEP;
D O I
10.1109/TCSVT.2020.3021128
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks have achieved excellent performance on face recognition (FR) by learning the high discriminative features with advanced loss functions. These improved loss functions share the similar idea for maximizing inter-class variance or minimizing intra-class variance. In this article, from a different perspective, we consider enlarging the inter-class variance by directly penalizing weight vectors of last fully connected layer, which represent the center of classes. To the end, we propose Orthogonality loss as an elegant penalty item appends to common classification loss to learn the discriminative representations. The main idea is that in order for weight vectors to be discriminative, it should be as close as possible to be orthogonal to each other in the vector space. More specifically, the optimization objective of Orthogonality loss is the first moment and second moment of cosine similarity of weight vectors. We performed the empirical studies through simulating the long-tail datasets to show the generalization ability of the proposed approach on long-tail distribution datasets. Further, extensive experiments on large-scale face recognition benchmarks including the Labeled Face in the Wild (LFW), the IARPA Janus Benchmark A (IJB-A), IJB-B, IJB-C, MegaFace Challenge 1 (MF1) and MS-Celeb-1M Low-shot Learning demonstrated that Orthogonality loss outperforms strong baselines, which showcases the extensive suitability and effectiveness of Orthogonality loss.
引用
收藏
页码:2301 / 2314
页数:14
相关论文
共 64 条
[31]   IARPA Janus Benchmark - C: Face Dataset and Protocol [J].
Maze, Brianna ;
Adams, Jocelyn ;
Duncan, James A. ;
Kalka, Nathan ;
Miller, Tim ;
Otto, Charles ;
Jain, Anil K. ;
Niggel, W. Tyler ;
Anderson, Janet ;
Cheney, Jordan ;
Grother, Patrick .
2018 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2018, :158-165
[32]  
Ng HW, 2014, IEEE IMAGE PROC, P343, DOI 10.1109/ICIP.2014.7025068
[33]  
Parkhi O. M., P BRIT MACHINE VISIO, V2015
[34]  
Ranjan R., 2017, ARXIV170309507
[35]  
Sankaranarayanan S., 2016, 2016 IEEE 8th international conference on biometrics theory, applications and systems (BTAS)
[36]  
Schroff F, 2015, PROC CVPR IEEE, P815, DOI 10.1109/CVPR.2015.7298682
[37]   A Double-Deep Spatio-Angular Learning Framework for Light Field-Based Face Recognition [J].
Sepas-Moghaddam, Alireza ;
Haque, Mohammad A. ;
Correia, Paulo Lobato ;
Nasrollahi, Kamal ;
Moeslund, Thomas B. ;
Pereira, Fernando .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) :4496-4512
[38]  
Sun Y., 2015, ARXIV PREPRINT ARXIV
[39]  
Sun Y, 2014, ADV NEUR IN, V27
[40]  
Sun Y, 2015, PROC CVPR IEEE, P2892, DOI 10.1109/CVPR.2015.7298907