Deep learning and face recognition: the state of the art

被引:57
作者
Balaban, Stephen [1 ]
机构
[1] Lambda Labs, Palo Alto, CA 94301 USA
来源
BIOMETRIC AND SURVEILLANCE TECHNOLOGY FOR HUMAN AND ACTIVITY IDENTIFICATION XII | 2015年 / 9457卷
关键词
Deep Learning; Feature Learning; Representation Learning; Facial Recognition; Face verification; Face identification; biometrics; NETWORK;
D O I
10.1117/12.2181526
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep Neural Networks (DNNs) have established themselves as a dominant technique in machine learning. DNNs have been top performers on a wide variety of tasks including image classification, speech recognition, and face recognition.(1-3) Convolutional neural networks (CNNs) have been used in nearly all of the top performing methods on the Labeled Faces in the Wild (LFW) dataset.(3-6) In this talk and accompanying paper, I attempt to provide a review and summary of the deep learning techniques used in the state-of-the-art. In addition, I highlight the need for both larger and more challenging public datasets to benchmark these systems. Despite the ability of DNNs and autoencoders to perform unsupervised feature learning, modern facial recognition pipelines still require domain specific engineering in the form of re-alignment. For example, in Facebook's recent Deep Face paper, a 3D "frontalization" step lies at the beginning of the pipeline. This step creates a 3D face model for the incoming image and then uses a series of affine transformations of the fiducial points to "frontalize" the image. This step enables the Deep Face system to use a neural network architecture with locally connected layers without weight sharing as opposed to standard convolutional layers.(6) Deep learning techniques combined with large datasets have allowed research groups to surpass human level performance on the LFW dataset.(3,5) The high accuracy (99.63% for Face Net at the time of publishing) and utilization of outside data (hundreds of millions of images in the case of Google's Face Net) suggest that current face verification benchmarks such as LFW may not be challenging enough, nor provide enough data, for current techniques.(3,5) There exist a variety of organizations with mobile photo sharing applications that would be capable of releasing a very large scale and highly diverse dataset of facial images captured on mobile devices. Such an "Image Net for Face Recognition" would likely receive a warm welcome from researchers and practitioners alike.
引用
收藏
页数:8
相关论文
共 34 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]  
[Anonymous], 2015, arXiv preprint arXiv:150200873
[3]  
[Anonymous], 2014, CoRR
[4]  
[Anonymous], 2014, ARXIV14117964
[5]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720
[6]  
Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26
[7]  
Bengio Y., 2006, Advances in Neural Information Processing Systems, V19, DOI DOI 10.7551/MITPRESS/7503.003.0024
[8]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[9]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[10]   A LIMITED MEMORY ALGORITHM FOR BOUND CONSTRAINED OPTIMIZATION [J].
BYRD, RH ;
LU, PH ;
NOCEDAL, J ;
ZHU, CY .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1995, 16 (05) :1190-1208