Two-Stage Multi-Scale Resolution-Adaptive Network for Low-Resolution Face Recognition

被引：4

作者：

Wang, Haihan ^{[1
]}

Wang, Shangfei ^{[1
]}

Fang, Lin ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年

基金：

中国国家自然科学基金;

关键词：

Low-resolution face recognition; Contrastive learning; Multi-resolution representation; Multi-scale distillation;

D O I：

10.1145/3503161.3548196

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Low-resolution face recognition is challenging due to uncertain input resolutions and the lack of distinguishing details in low-resolution (LR) facial images. Resolution-invariant representations must be learned for optimal performance. Existing methods for this task mainly minimize the distance between the representations of the low-resolution (LR) and corresponding high-resolution (HR) image pairs in a common subspace. However, these works only focus on introducing various distance metrics at the final layer and between HR-LR image pairs. They do not fully utilize the intermediate layers or multi-resolution supervision, yielding only modest performance. In this paper, we propose a novel two-stage multi-scale resolution-adaptive network to learn more robust resolution-invariant representations. In the first stage, the structural patterns and the semantic patterns are distilled from HR images to provide sufficient supervision for LR images. A curriculum learning strategy facilitates the training of HR and LR image matching, smoothly decreasing the resolution of LR images. In the second stage, a multi-resolution contrastive loss is introduced on LR images to enforce intra-class clustering and inter-class separation of the LR representations. By introducing multi-scale supervision and multi-resolution LR representation clustering, our network can produce robust representations despite uncertain input sizes. Experimental results on eight benchmark datasets demonstrate the effectiveness of the proposed method. Code will be released at https://github.com/hhwang98/TMR.

引用

页码：4053 / 4062

页数：10

共 30 条

[1]

[Anonymous], 2018, AS C COMP VIS

[2]

Cheng Z., 2018, Technical Report

[3] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[4] Generate to Adapt: Resolution Adaption Network for Surveillance Face Recognition [J].

Fang, Han ;

Deng, Weihong ;

Zhong, Yaoyao ;

Hu, Jiani .

COMPUTER VISION - ECCV 2020, PT XV, 2020, 12360 :741-758

[5] SCface - surveillance cameras face database [J].

Grgic, Mislav ;

Delac, Kresimir ;

Grgic, Sonja .

MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (03) :863-879

[6] Momentum Contrast for Unsupervised Visual Representation Learning [J].

He, Kaiming ;

Fan, Haoqi ;

Wu, Yuxin ;

Xie, Saining ;

Girshick, Ross .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8]

Hjelm R.D., 2018, Learning deep representations by mutual information estimation and maximization

[9]

Huang GaryB., 2008, WORKSHOP FACES INREA

[10] Improving Face Recognition from Hard Samples via Distribution Distillation Loss [J].

Huang, Yuge ;

Shen, Pengcheng ;

Tai, Ying ;

Li, Shaoxin ;

Liu, Xiaoming ;

Li, Jilin ;

Huang, Feiyue ;

Ji, Rongrong .

COMPUTER VISION - ECCV 2020, PT XXX, 2020, 12375 :138-154

← 1 2 3 →