An automatic system for unconstrained video-based face recognition

被引：28

作者：

Zheng J. ^{[1
]}

Ranjan R. ^{[1
]}

Chen C.-H. ^{[1
]}

Chen J.-C. ^{[1
,2
]}

Castillo C.D. ^{[1
]}

Chellappa R. ^{[1
]}

机构：

[1] Department of Electrical and Computer Engineering, University of Maryland at College Park, College Park, 20742, MD

[2] Research Center for Information Technology Innovation, Academia Sinica, Taipei

来源：

IEEE Transactions on Biometrics, Behavior, and Identity Science | 2020年 / 2卷 / 03期

关键词：

Face association; Face tracking; Unconstrained video-based face recognition;

D O I：

10.1109/TBIOM.2020.2973504

中图分类号：

学科分类号：

摘要：

Although deep learning approaches have achieved performance surpassing humans for still image-based face recognition, unconstrained video-based face recognition is still a challenging task due to large volume of data to be processed and intra/inter-video variations on pose, illumination, occlusion, scene, blur, video quality, etc. In this work, we consider challenging scenarios for unconstrained video-based face recognition from multiple-shot videos and surveillance videos with low-quality frames. To handle these problems, we propose a robust and efficient system for unconstrained video-based face recognition, which is composed of modules for face/fiducial detection, face association, and face recognition. First, we use multi-scale singleshot face detectors to efficiently localize faces in videos. The detected faces are then grouped through carefully designed face association methods, especially for multi-shot videos. Finally, the faces are recognized by the proposed face matcher based on an unsupervised subspace learning approach and a subspace-tosubspace similarity metric. Extensive experiments on challenging video datasets, such as Multiple Biometric Grand Challenge (MBGC), Face and Ocular Challenge Series (FOCS), IARPA Janus Surveillance Video Benchmark (IJB-S) for low-quality surveillance videos and IARPA JANUS Benchmark B (IJB-B) for multiple-shot videos, demonstrate that the proposed system can accurately detect and associate faces from unconstrained videos and effectively learn robust and discriminative features for recognition. © 2020 IEEE.

引用

页码：194 / 209

页数：15

共 64 条

[1]

Whitelam C., Et al., Iarpa janus benchmark-B face dataset, Proc. IEEE Conf. Comput. Vis. Pattern Recognit. Workshops (CVPRW), pp. 592-600, (2017)

[2]

Kalka N.D., Et al., Ijb-s: Iarpa janus surveillance video benchmark, Proc. IEEE 9th Int. Conf. Biometrics Theory Appl. Syst. (BTAS), pp. 1-9, (2018)

[3]

Phillips P.J., Et al., Overview of the multiple biometrics grand challenge, Advances in Biometrics (Lncs 5558), pp. 705-714, (2009)

[4]

O'Toole A.J., Et al., A video database of moving faces and people, IEEE Trans. Pattern Anal. Mach. Intell., 27, 5, pp. 812-816, (2005)

[5]

He K., Zhang X., Ren S., Sun J., Deep Residual Learning for Image Recognition, (2015)

[6]

Ren S., He K., Girshick R., Sun J., Faster r-cnn: Towards realtime object detection with region proposal networks, Proc. Adv. Neural Inf. Process. Syst., 28, pp. 91-99, (2015)

[7]

Chen L.-C., Papandreou G., Kokkinos I., Murphy K., Yuille A.L., DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs, IEEE Trans. Pattern Anal. Mach. Intell., 40, 4, pp. 834-848, (2018)

[8]

Ranjan R., Sankaranarayanan S., Castillo C.D., Chellappa R., An all-in-one convolutional neural network for face analysis, Proc. 12th IEEE Int. Conf. Autom. Face Gesture Recognit. (FG), pp. 17-24, (2017)

[9]

Najibi M., Samangouei P., Chellappa R., Davis L.S., SSH: Single stage headless face detector, Proc. IEEE Int. Conf. Comput. Vis. (ICCV), pp. 4885-4894, (2017)

[10]

Taigman Y., Yang M., Ranzato M., Wolf L., DeepFace: Closing the gap to human-level performance in face verification, Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 1701-1708, (2014)

← 1 2 3 4 5 6 7 →