Orthogonality Loss: Learning Discriminative Representations for Face Recognition

被引:21
作者
Yang, Shanming [1 ]
Deng, Weihong [1 ]
Wang, Mei [1 ]
Du, Junping [2 ]
Hu, Jiani [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Comp Sci & Technol, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Face recognition; Face; Feature extraction; Training; Robustness; Matrix decomposition; Benchmark testing; discriminative representations; orthogonality; inter-class distance; MARGIN SOFTMAX; DEEP;
D O I
10.1109/TCSVT.2020.3021128
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks have achieved excellent performance on face recognition (FR) by learning the high discriminative features with advanced loss functions. These improved loss functions share the similar idea for maximizing inter-class variance or minimizing intra-class variance. In this article, from a different perspective, we consider enlarging the inter-class variance by directly penalizing weight vectors of last fully connected layer, which represent the center of classes. To the end, we propose Orthogonality loss as an elegant penalty item appends to common classification loss to learn the discriminative representations. The main idea is that in order for weight vectors to be discriminative, it should be as close as possible to be orthogonal to each other in the vector space. More specifically, the optimization objective of Orthogonality loss is the first moment and second moment of cosine similarity of weight vectors. We performed the empirical studies through simulating the long-tail datasets to show the generalization ability of the proposed approach on long-tail distribution datasets. Further, extensive experiments on large-scale face recognition benchmarks including the Labeled Face in the Wild (LFW), the IARPA Janus Benchmark A (IJB-A), IJB-B, IJB-C, MegaFace Challenge 1 (MF1) and MS-Celeb-1M Low-shot Learning demonstrated that Orthogonality loss outperforms strong baselines, which showcases the extensive suitability and effectiveness of Orthogonality loss.
引用
收藏
页码:2301 / 2314
页数:14
相关论文
共 64 条
[1]  
[Anonymous], 2014, Comput. Sci.
[2]  
Cai T, 2013, J MACH LEARN RES, V14, P1837
[3]  
Chen JC, 2016, IEEE WINT CONF APPL
[4]   Know You at One Glance: A Compact Vector Representation for Low-Shot Learning [J].
Cheng, Yu ;
Zhao, Jian ;
Wang, Zhecan ;
Xu, Yan ;
Jayashree, Karlekar ;
Shen, Shengmei ;
Feng, Jiashi .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :1924-1932
[5]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[6]  
Cui JY, 2007, CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1 AND 2, P367
[7]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Xue, Niannan ;
Zafeiriou, Stefanos .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694
[8]   Robust Discriminative Metric Learning for Image Representation [J].
Ding, Zhengming ;
Shao, Ming ;
Hwang, Wonjun ;
Suh, Sungjoov ;
Han, Jae-Joon ;
Choi, Changkyu ;
Fu, Yun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) :3173-3183
[9]   Age Factor Removal Network Based on Transfer Learning and Adversarial Learning for Cross-Age Face Recognition [J].
Du, Lingshuang ;
Hu, Haifeng ;
Wu, Yongbo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (09) :2830-2842
[10]   Encouraging orthogonality between weight vectors in pretrained deep neural networks [J].
Grzegorczyk, Karol ;
Kurdziel, Martin ;
Wojcik, Piotr Iwo .
NEUROCOMPUTING, 2016, 202 :84-90