Multiple Network Fusion with Low-Rank Representation for Image-Based Age Estimation

被引:0
作者
Hong, Chaoqun [1 ,2 ]
Zeng, Zhiqiang [1 ]
Wang, Xiaodong [1 ]
Zhuang, Weiwei [1 ]
机构
[1] Xiamen Univ Technol, Sch Comp & Informat Engn, Xiamen 361024, Peoples R China
[2] Ligong Rd 600, Xiamen 361024, Fujian, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 09期
基金
中国国家自然科学基金;
关键词
age estimation; multi-modal features; deep learning; low-rank representation; CLASSIFICATION;
D O I
10.3390/app8091601
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Image-based age estimation is a challenging task since there are ambiguities between the apparent age of face images and the actual ages of people. Therefore, data-driven methods are popular. To improve data utilization and estimation performance, we propose an image-based age estimation method. Theoretically speaking, the key idea of the proposed method is to integrate multi-modal features of face images. In order to achieve it, we propose a multi-modal learning framework, which is called Multiple Network Fusion with Low-Rank Representation (MNF-LRR). In this process, different deep neural network (DNN) structures, such as autoencoders, Convolutional Neural Networks (CNNs), Recursive Neural Networks (RNNs), and so on, can be used to extract semantic information of facial images. The outputs of these neural networks are then represented in a low-rank feature space. In this way, feature fusion is obtained in this space, and robust multi-modal image features can be computed. An experimental evaluation is conducted on two challenging face datasets for image-based age estimation extracted from the Internet Move Database (IMDB) and Wikipedia (WIKI). The results show the effectiveness of the proposed MNF-LRR.
引用
收藏
页数:13
相关论文
共 36 条
[1]   Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[3]  
[Anonymous], 2014, INT C MACH LEARN
[4]  
[Anonymous], 2012, PREDICTION CANDIDATE
[5]  
[Anonymous], 2013, ICB, DOI DOI 10.1109/ICB.2013.6613022
[6]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[7]  
Bo L., 2010, TWIN GAUSSIAN PROCES, P28
[8]  
Chen BC, 2014, LECT NOTES COMPUT SC, V8694, P768, DOI 10.1007/978-3-319-10599-4_49
[9]  
Cootes T. F., 1998, Computer Vision - ECCV'98. 5th European Conference on Computer Vision. Proceedings, P484, DOI 10.1007/BFb0054760
[10]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893