D2C: Deep cumulatively and comparatively learning for human age estimation

被引:46
作者
Li, Kai [1 ]
Xing, Junliang [1 ]
Hu, Weiming [1 ,2 ]
Maybank, Stephen J. [3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Automat, CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 100190, Peoples R China
[3] Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Age estimation; Deep learning; Convolutional neural network; DIMENSIONALITY;
D O I
10.1016/j.patcog.2017.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Age estimation from face images is an important yet difficult task in computer vision. Its main difficulty lies in how to design aging features that remain discriminative in spite of large facial appearance variations. Meanwhile, due to the difficulty of collecting and labeling datasets that contain sufficient samples for all possible ages, the age distributions of most benchmark datasets are often imbalanced, which makes this problem more challenge. In this work, we try to solve these difficulties by means of the mainstream deep learning techniques. Specifically, we use a convolutional neural network which can learn discriminative aging features from raw face images without any handcrafting. To combat the sample imbalance problem, we propose a novel cumulative hidden layer which is supervised by a point-wise cumulative signal. With this cumulative hidden layer, our model is learnt indirectly using faces with neighbouring ages and thus alleviate the sample imbalance problem. In order to learn more effective aging features, we further propose a comparative ranking layer which is supervised by a pair-wise comparative signal. This comparative ranking layer facilitates aging feature learning and improves the performance of the main age estimation task. In addition, since one face can be included in many different training pairs, we can make full use of the limited training data. It is noted that both of these two novel layers are differentiable, so our model is end-to-end trainable. Extensive experiments on the two of the largest benchmark datasets show that our deep age estimation model gains notable advantage on accuracy when compared against existing methods.
引用
收藏
页码:95 / 105
页数:11
相关论文
共 54 条
[1]  
[Anonymous], P INT S COMP INF SCI
[2]  
[Anonymous], 2012, THESIS
[3]  
[Anonymous], ABS14091556 CORR
[4]  
[Anonymous], 2014, P AS C COMP VIS
[5]  
[Anonymous], CVPR
[6]  
[Anonymous], 2015, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
[7]  
[Anonymous], 2007, P IEEE 11 INT C COMP
[8]  
[Anonymous], 2016, ACM INT C MULT
[9]  
Bengio Y., 2009, P 26 ANN INT C MACHI, P41, DOI DOI 10.1145/1553374.1553380
[10]  
Burges C., 2005, P INT C MACH LEARN I, P5