Distance metric optimization driven convolutional neural network for age invariant face recognition

被引:151
作者
Li, Ya [1 ]
Wang, Guangrun [2 ]
Nie, Lin [2 ]
Wang, Qing [2 ]
Tan, Wenwei [3 ]
机构
[1] Guangzhou Univ, Higher Educ Mega Ctr, Guangzhou 510006, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Higher Educ Mega Ctr, Guangzhou 510006, Guangdong, Peoples R China
[3] Hisilicon Technol Co Ltd, Shenzhen 518129, Peoples R China
关键词
Age invariant; Face recognition; Distance metric; Deep CNN; Joint learning;
D O I
10.1016/j.patcog.2017.10.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the great advances in face-related works in recent years, face recognition across age remains a challenging problem. The traditional approaches to this problem usually include two basic steps: feature extraction and the application of a distance metric, sometimes common space projection is also involved. On the one hand, handling these steps separately ignores the interactions of these components, and on the other hand, the fixed-distance threshold of measurement affects the model's robustness. In this paper, we present a novel distance metric optimization driven learning approach that integrates these traditional steps via a deep convolutional neural network, which learns feature representations and the decision function in an end-to-end way. Given the labelled training images, we first generate a large number of pairs with a certain proportion of matched and unmatched pairs. For matched pairs, we try to select as many different age instances as possible for each person to learn the identification information that is not affected by age. Then, taking these pairs as input, we aim to enlarge the differences between the unmatched pairs while reducing the variations between the matched pairs, and we update the model parameters by using the mini-batch stochastic gradient descent (SGD) algorithm. Specifically, the distance matrix is used as the top fully connected layer, and the bottom layers representing the image features are integrated with it seamlessly. Thus, the image features and the distance metric can be optimized simultaneously by backward propagation. In particular, we introduce several training strategies to reduce the computational cost and overcome insufficient memory capacity. We evaluate our method on three tasks: age-invariant face identification on the MORPH database, age-invariant face retrieval on the CACD database and age-invariant face verification on CACD-VS database. The experimental results demonstrate the effectiveness of our approach. (C) 2017 Published by Elsevier Ltd.
引用
收藏
页码:51 / 62
页数:12
相关论文
共 47 条
  • [1] Ahonen T., IEEE T PATTERN ANAL, P28
  • [2] [Anonymous], 2013, P 10 IEEE INT C WORK, DOI DOI 10.1109/FG.2013.6553724
  • [3] [Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299173
  • [4] [Anonymous], 2012, Advances in Neural Information Processing Systems
  • [5] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
    Belhumeur, PN
    Hespanha, JP
    Kriegman, DJ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) : 711 - 720
  • [6] Local descriptors in application to the aging problem in face recognition
    Bereta, Michal
    Karczmarek, Pawel
    Pedrycz, Witold
    Reformat, Marek
    [J]. PATTERN RECOGNITION, 2013, 46 (10) : 2634 - 2646
  • [7] Nonlinear Topological Component Analysis: Application to Age-Invariant Face Recognition
    Bouchaffra, Djamel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (07) : 1375 - 1387
  • [8] PCANet: A Simple Deep Learning Baseline for Image Classification?
    Chan, Tsung-Han
    Jia, Kui
    Gao, Shenghua
    Lu, Jiwen
    Zeng, Zinan
    Ma, Yi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5017 - 5032
  • [9] Facial age estimation based on label-sensitive learning and age-oriented regression
    Chao, Wei-Lun
    Liu, Jun-Zuo
    Ding, Jian-Jiun
    [J]. PATTERN RECOGNITION, 2013, 46 (03) : 628 - 641
  • [10] Chen B.-C., IEEE T MULTIMEDIA, P17