Dynamic Texture Comparison Using Derivative Sparse Representation: Application to Video-Based Face Recognition

被引:21
作者
Hajati, Farshid [1 ]
Tavakolian, Mohammad [1 ]
Gheisari, Soheila [2 ]
Gao, Yongsheng [3 ]
Mian, Ajmal S. [4 ]
机构
[1] Tafresh Univ, Dept Elect Engn, Tafresh 3951879611, Iran
[2] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
[3] Griffith Univ, Sch Engn, Nathan, Qld 4111, Australia
[4] Univ Western Australia, Sch Comp Sci & Software Engn, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Directional derivative; dynamic texture; Grass-mann manifold; high-order pattern; sparse representation; spatiotemporal; LOCAL BINARY PATTERNS; TEMPORAL TEXTURE; ALGORITHMS; CLASSIFICATION; APPROXIMATION;
D O I
10.1109/THMS.2017.2681425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based face, expression, and scene recognition are fundamental problems in human-machine interaction, especially when there is a short-length video. In this paper, we present a new derivative sparse representation approach for face and texture recognition using short-length videos. First, it builds local linear subspaces of dynamic texture segments by computing spatiotemporal directional derivatives in a cylinder neighborhood within dynamic textures. Unlike traditional methods, a nonbinary texture coding technique is proposed to extract high-order derivatives using continuous circular and cylinder regions to avoid aliasing effects. Then, these local linear subspaces of texture segments are mapped onto a Grassmann manifold via sparse representation. A new joint sparse representation algorithm is developed to establish the correspondences of subspace points on the manifold for measuring the similarity between two dynamic textures. Extensive experiments on the Honda/UCSD, the CMU motion of body, the YouTube, and the DynTex datasets show that the proposed method consistently outperforms the state-of-the-art methods in dynamic texture recognition, and achieved the encouraging highest accuracy reported to date on the challenging YouTube face dataset. The encouraging experimental results show the effectiveness of the proposed method in video-based face recognition in human-machine system applications.
引用
收藏
页码:970 / 982
页数:13
相关论文
共 61 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]  
Amari S., 2001, Methods of Information Geometry
[3]  
[Anonymous], COMP IMAG VIS
[4]  
[Anonymous], P IEEE C COMP VIS PA
[5]  
[Anonymous], 2001, Cmu Ri Tr 01-18
[6]  
[Anonymous], 2008, PROC WORKSHOP FACES
[7]  
Arandjelovic O, 2005, PROC CVPR IEEE, P581
[8]   On Recognizing Faces in Videos Using Clustering-Based Re-Ranking and Fusion [J].
Bhatt, Himanshu S. ;
Singh, Richa ;
Vatsa, Mayank .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (07) :1056-1068
[9]   Face Recognition Based on Image Sets [J].
Cevikalp, Hakan ;
Triggs, Bill .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2567-2573
[10]  
Chan A. B., 2007, PROC IEEE COMPUT SOC, P1