Frankenstein: Learning Deep Face Representations Using Small Data

被引:78
作者
Hu, Guosheng [1 ]
Peng, Xiaojiang [2 ]
Yang, Yongxin [3 ]
Hospedales, Timothy M. [4 ]
Verbeek, Jakob [1 ]
机构
[1] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[2] Hengyang Normal Univ, Hengyang 421008, Peoples R China
[3] Queen Mary Univ London, Elect Engn & Comp Sci, London E1 4NS, England
[4] Univ Edinburgh, Edinburgh EH8 9JS, Midlothian, Scotland
关键词
Face recognition; deep learning; small training data; RECOGNITION;
D O I
10.1109/TIP.2017.2756450
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks have recently proven extremely effective for difficult face recognition problems in uncontrolled settings. To train such networks, very large training sets are needed with millions of labeled images. For some applications, such as near-infrared (NIR) face recognition, such large training data sets are not publicly available and difficult to collect. In this paper, we propose a method to generate very large training data sets of synthetic images by compositing real face images in a given data set. We show that this method enables to learn models from as few as 10 000 training images, which perform on par with models trained from 500 000 images. Using our approach, we also obtain state-of-the-art results on the CASIA NIR-VIS2.0 heterogeneous face recognition data set.
引用
收藏
页码:293 / 303
页数:11
相关论文
共 68 条
[51]   The CMU pose, illumination, and expression (PIE) database [J].
Sim, T ;
Baker, S ;
Bsat, M .
FIFTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2002, :53-58
[52]   Fisher Vector Faces in the Wild [J].
Simonyan, Karen ;
Parkhi, Omkar M. ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[53]   Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views [J].
Su, Hao ;
Qi, Charles R. ;
Li, Yangyan ;
Guibas, Leonidas J. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2686-2694
[54]   ACTIVE: Activity Concept Transitions in Video Event Classification [J].
Sun, Chen ;
Nevatia, Ram .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :913-920
[55]  
Sun Y., 2014, Deeply learned face representations are sparse, selective, and robust
[56]  
Sun Y, 2014, ADV NEUR IN, V27
[57]   Hybrid Deep Learning for Face Verification [J].
Sun, Yi ;
Wang, Xiaogang ;
Tang, Xiaoou .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1489-1496
[58]   Deep Learning Face Representation from Predicting 10,000 Classes [J].
Sun, Yi ;
Wang, Xiaogang ;
Tang, Xiaoou .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1891-1898
[59]  
Szegedy Christian, 2015, P IEEE C COMP VIS PA, P1, DOI [10.1109/cvpr.2015.7298594, DOI 10.1109/CVPR.2015.7298594]
[60]   DeepFace: Closing the Gap to Human-Level Performance in Face Verification [J].
Taigman, Yaniv ;
Yang, Ming ;
Ranzato, Marc'Aurelio ;
Wolf, Lior .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1701-1708