Hierarchical Pyramid Diverse Attention Networks for Face Recognition

被引：54

作者：

Wang, Qiangchang ^{[1
,4
]}

Wu, Tianyi ^{[2
,3
]}

Zheng, He ^{[2
,3
]}

Guo, Guodong ^{[1
,2
,3
]}

机构：

[1] West Virginia Univ, Morgantown, WV 26506 USA

[2] Baidu Res, Inst Deep Learning, Beijing, Peoples R China

[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China

[4] Baidu, Beijing, Peoples R China

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00835

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning has achieved a great success in face recognition (FR), however, few existing models take hierarchical multi-scale local features into consideration. In this work, we propose a hierarchical pyramid diverse attention (HPDA) network. First, it is observed that local patches would play important roles in FR when the global face appearance changes dramatically. Some recent works apply attention modules to locate local patches automatically without relying on face landmarks. Unfortunately, without considering diversity, some learned attentions tend to have redundant responses around some similar local patches, while neglecting other potential discriminative facial parts. Meanwhile, local patches may appear at different scales due to pose variations or large expression changes. To alleviate these challenges, we propose a pyramid diverse attention (PDA) to learn multi-scale diverse local representations automatically and adaptively. More specifically, a pyramid attention is developed to capture multi-scale features. Meanwhile, a diverse learning is developed to encourage models to focus on different local patches and generate diverse local features. Second, almost all existing models focus on extracting features from the last convolutional layer, lacking of local details or small-scale face parts in lower layers. Instead of simple concatenation or addition, we propose to use a hierarchical bilinear pooling (HBP) to fuse information from multiple layers effectively. Thus, the HPDA is developed by integrating the PDA into the HBP. Experimental results on several datasets show the effectiveness of the HPDA, compared to the state-of-the-art methods.

引用

页码：8323 / 8332

页数：10

共 46 条

[1]

[Anonymous], 2015, Proc. IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, DOI DOI 10.1109/CVPR.2015.7298803

[2]

[Anonymous], 2018, CROSS POSE LFW DATA, DOI DOI 10.1145/3267494.3267495

[3] VGGFace2: A dataset for recognising faces across pose and age [J].

Cao, Qiong ;

Shen, Li ;

Xie, Weidi ;

Parkhi, Omkar M. ;

Zisserman, Andrew .

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74

[4]

Chen BC, 2014, LECT NOTES COMPUT SC, V8694, P768, DOI 10.1007/978-3-319-10599-4_49

[5] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[6] Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition [J].

Ding, Changxing ;

Tao, Dacheng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :1002-1014

[7]

Ding Guodong, 2017, ARXIV

[8] Dual Attention Network for Scene Segmentation [J].

Fu, Jun ;

Liu, Jing ;

Tian, Haijie ;

Li, Yong ;

Bao, Yongjun ;

Fang, Zhiwei ;

Lu, Hanqing .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149

[9] PCCF: Periodic and continual temporal co-factorization for recommender systems [J].

Guo, Guibing ;

Zhu, Feida ;

Qu, Shilin ;

Wang, Xingwei .

INFORMATION SCIENCES, 2018, 436 :56-73

[10]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

← 1 2 3 4 5 →