Expression-robust 3D face recognition via weighted sparse representation of multi-scale and multi-component local normal patterns

被引:50
作者
Li, Huibin [1 ,2 ]
Huang, Di [3 ]
Morvan, Jean-Marie [1 ,4 ,5 ]
Chen, Liming [1 ,2 ]
Wang, Yunhong [3 ]
机构
[1] Univ Lyon, CNRS, Lyon, France
[2] Ecole Cent Lyon, LIRIS UMR5205, F-69134 Lyon, France
[3] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
[4] Univ Lyon 1, Inst Camille Jordan, F-69622 Villeurbanne, France
[5] King Abdullah Univ Sci & Technol, GMSV Res Ctr, Thuwal 239556900, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Facial surface normal; Local normal patterns; Weighted sparse representation; 3D face recognition; Identical twins; BINARY PATTERNS; IMAGES; CLASSIFICATION; REGISTRATION; CURVES; SCALE; POINT;
D O I
10.1016/j.neucom.2013.11.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the theory of differential geometry, surface normal, as a first order surface differential quantity, determines the orientation of a surface at each point and contains informative local surface shape information. To fully exploit this kind of information for 3D face recognition (FR), this paper proposes a novel highly discriminative facial shape descriptor, namely multi-scale and multi-component local normal patterns (MSMC-LNP). Given a normalized facial range image, three components of normal vectors are first estimated, leading to three normal component images. Then, each normal component image is encoded locally to local normal patterns (LNP) on different scales. To utilize spatial information of facial shape, each normal component image is divided into several patches, and their LNP histograms are computed and concatenated according to the facial configuration. Finally, each original facial surface is represented by a set of LNP histograms including both global and local cues. Moreover, to make the proposed solution robust to the variations of facial expressions, we propose to learn the weight of each local patch on a given encoding scale and normal component image. Based on the learned weights and the weighted LNP histograms, we formulate a weighted sparse representation-based classifier (W-SRC). In contrast to the overwhelming majority of 3D FR approaches which were only benchmarked on the FRGC v2.0 database, we carried out extensive experiments on the FRGC v2.0, Bosphorus, BU-3DFE and 3D-TEC databases, thus including 3D face data captured in different scenarios through various sensors and depicting in particular different challenges with respect to facial expressions. The experimental results show that the proposed approach consistently achieves competitive rank-one recognition rates on these databases despite their heterogeneous nature, and thereby demonstrates its effectiveness and its generalizability. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:179 / 193
页数:15
相关论文
共 63 条
[1]  
Abate A.F., 2007, P IEEE INT C IM PROC
[2]  
Abate A.F., 2006, P IEEE INT C IM PROC
[3]  
Abate A.F., 2005, P IEEE INT C IM PROC
[4]   Normal maps vs. visible images: Comparing classifiers and combining modalities [J].
Abate, Andrea Francesco ;
De Marsico, Maria ;
Ricciardi, Stefano ;
Riccio, Daniel .
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2009, 20 (03) :156-168
[5]  
Ahonen T., 2004, P INT C EUR C COMP V
[6]   Integration of local and global geometrical cues for 3D face recognition [J].
Ai-Osaimi, F. R. ;
Bennamoun, M. ;
Mian, A. .
PATTERN RECOGNITION, 2008, 41 (03) :1030-1040
[7]   An Expression Deformation Approach to Non-rigid 3D Face Recognition [J].
Al-Osaimi, F. ;
Bennamoun, M. ;
Mian, A. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 81 (03) :302-316
[8]   Regional Registration for Expression Resistant 3-D Face Recognition [J].
Alyuz, Nese ;
Gokberk, Berk ;
Akarun, Lale .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2010, 5 (03) :425-440
[9]  
Amor B.B., 2006, 6 INT C INT SYST DES, P150
[10]   Lambertian reflectance and linear subspaces [J].
Basri, R ;
Jacobs, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (02) :218-233