An automatic system for unconstrained video-based face recognition

被引:28
作者
Zheng J. [1 ]
Ranjan R. [1 ]
Chen C.-H. [1 ]
Chen J.-C. [1 ,2 ]
Castillo C.D. [1 ]
Chellappa R. [1 ]
机构
[1] Department of Electrical and Computer Engineering, University of Maryland at College Park, College Park, 20742, MD
[2] Research Center for Information Technology Innovation, Academia Sinica, Taipei
来源
IEEE Transactions on Biometrics, Behavior, and Identity Science | 2020年 / 2卷 / 03期
关键词
Face association; Face tracking; Unconstrained video-based face recognition;
D O I
10.1109/TBIOM.2020.2973504
中图分类号
学科分类号
摘要
Although deep learning approaches have achieved performance surpassing humans for still image-based face recognition, unconstrained video-based face recognition is still a challenging task due to large volume of data to be processed and intra/inter-video variations on pose, illumination, occlusion, scene, blur, video quality, etc. In this work, we consider challenging scenarios for unconstrained video-based face recognition from multiple-shot videos and surveillance videos with low-quality frames. To handle these problems, we propose a robust and efficient system for unconstrained video-based face recognition, which is composed of modules for face/fiducial detection, face association, and face recognition. First, we use multi-scale singleshot face detectors to efficiently localize faces in videos. The detected faces are then grouped through carefully designed face association methods, especially for multi-shot videos. Finally, the faces are recognized by the proposed face matcher based on an unsupervised subspace learning approach and a subspace-tosubspace similarity metric. Extensive experiments on challenging video datasets, such as Multiple Biometric Grand Challenge (MBGC), Face and Ocular Challenge Series (FOCS), IARPA Janus Surveillance Video Benchmark (IJB-S) for low-quality surveillance videos and IARPA JANUS Benchmark B (IJB-B) for multiple-shot videos, demonstrate that the proposed system can accurately detect and associate faces from unconstrained videos and effectively learn robust and discriminative features for recognition. © 2020 IEEE.
引用
收藏
页码:194 / 209
页数:15
相关论文
共 64 条
[1]  
Whitelam C., Et al., Iarpa janus benchmark-B face dataset, Proc. IEEE Conf. Comput. Vis. Pattern Recognit. Workshops (CVPRW), pp. 592-600, (2017)
[2]  
Kalka N.D., Et al., Ijb-s: Iarpa janus surveillance video benchmark, Proc. IEEE 9th Int. Conf. Biometrics Theory Appl. Syst. (BTAS), pp. 1-9, (2018)
[3]  
Phillips P.J., Et al., Overview of the multiple biometrics grand challenge, Advances in Biometrics (Lncs 5558), pp. 705-714, (2009)
[4]  
O'Toole A.J., Et al., A video database of moving faces and people, IEEE Trans. Pattern Anal. Mach. Intell., 27, 5, pp. 812-816, (2005)
[5]  
He K., Zhang X., Ren S., Sun J., Deep Residual Learning for Image Recognition, (2015)
[6]  
Ren S., He K., Girshick R., Sun J., Faster r-cnn: Towards realtime object detection with region proposal networks, Proc. Adv. Neural Inf. Process. Syst., 28, pp. 91-99, (2015)
[7]  
Chen L.-C., Papandreou G., Kokkinos I., Murphy K., Yuille A.L., DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs, IEEE Trans. Pattern Anal. Mach. Intell., 40, 4, pp. 834-848, (2018)
[8]  
Ranjan R., Sankaranarayanan S., Castillo C.D., Chellappa R., An all-in-one convolutional neural network for face analysis, Proc. 12th IEEE Int. Conf. Autom. Face Gesture Recognit. (FG), pp. 17-24, (2017)
[9]  
Najibi M., Samangouei P., Chellappa R., Davis L.S., SSH: Single stage headless face detector, Proc. IEEE Int. Conf. Comput. Vis. (ICCV), pp. 4885-4894, (2017)
[10]  
Taigman Y., Yang M., Ranzato M., Wolf L., DeepFace: Closing the gap to human-level performance in face verification, Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 1701-1708, (2014)