Frankenstein: Learning Deep Face Representations Using Small Data

被引：78

作者：

Hu, Guosheng ^{[1
]}

Peng, Xiaojiang ^{[2
]}

Yang, Yongxin ^{[3
]}

Hospedales, Timothy M. ^{[4
]}

Verbeek, Jakob ^{[1
]}

机构：

[1] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France

[2] Hengyang Normal Univ, Hengyang 421008, Peoples R China

[3] Queen Mary Univ London, Elect Engn & Comp Sci, London E1 4NS, England

[4] Univ Edinburgh, Edinburgh EH8 9JS, Midlothian, Scotland

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2018年 / 27卷 / 01期

关键词：

Face recognition; deep learning; small training data; RECOGNITION;

D O I：

10.1109/TIP.2017.2756450

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep convolutional neural networks have recently proven extremely effective for difficult face recognition problems in uncontrolled settings. To train such networks, very large training sets are needed with millions of labeled images. For some applications, such as near-infrared (NIR) face recognition, such large training data sets are not publicly available and difficult to collect. In this paper, we propose a method to generate very large training data sets of synthetic images by compositing real face images in a given data set. We show that this method enables to learn models from as few as 10 000 training images, which perform on par with models trained from 500 000 images. Using our approach, we also obtain state-of-the-art results on the CASIA NIR-VIS2.0 heterogeneous face recognition data set.

引用

页码：293 / 303

页数：11

共 68 条

[51] The CMU pose, illumination, and expression (PIE) database [J].

Sim, T ;

Baker, S ;

Bsat, M .

FIFTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2002, :53-58

[52] Fisher Vector Faces in the Wild [J].

Simonyan, Karen ;

Parkhi, Omkar M. ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,

[53] Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views [J].

Su, Hao ;

Qi, Charles R. ;

Li, Yangyan ;

Guibas, Leonidas J. .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2686-2694

[54] ACTIVE: Activity Concept Transitions in Video Event Classification [J].

Sun, Chen ;

Nevatia, Ram .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :913-920

[55]

Sun Y., 2014, Deeply learned face representations are sparse, selective, and robust

[56]

Sun Y, 2014, ADV NEUR IN, V27

[57] Hybrid Deep Learning for Face Verification [J].

Sun, Yi ;

Wang, Xiaogang ;

Tang, Xiaoou .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1489-1496

[58] Deep Learning Face Representation from Predicting 10,000 Classes [J].

Sun, Yi ;

Wang, Xiaogang ;

Tang, Xiaoou .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1891-1898

[59]

Szegedy Christian, 2015, P IEEE C COMP VIS PA, P1, DOI [10.1109/cvpr.2015.7298594, DOI 10.1109/CVPR.2015.7298594]

[60] DeepFace: Closing the Gap to Human-Level Performance in Face Verification [J].

Taigman, Yaniv ;

Yang, Ming ;

Ranzato, Marc'Aurelio ;

Wolf, Lior .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1701-1708

← 1 2 3 4 5 6 7 →