Human activity recognition by separating style and content

被引:9
作者
Cheema, Muhammad Shahzad [1 ]
Eweiwi, Abdalrahman [1 ]
Bauckhage, Christian [1 ,2 ]
机构
[1] Univ Bonn, Bonn Aachen Int Ctr IT, D-53113 Bonn, Germany
[2] Fraunhofer Inst Intelligent Anal & Informat Syst, Sankt Augsitin, Germany
关键词
Action recognition; Bilinear models; Kinect depth images; Motion history volumes; Motion capture; Expectation maximization;
D O I
10.1016/j.patrec.2013.09.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Studies in psychophysics suggest that people tend to perform different actions in their own style. This article deals with the problem of recognizing human actions and the underlying execution styles (actors) in videos. We present a hierarchical approach that is based on conventional action recognition and asymmetrical bilinear modeling. In particular, we employ bilinear factorization on the tensorial representation of the action videos to characterize styles of performing different actions. Our approach is solely based on the dynamics of the underlying activity. The model is evaluated on the IXMAS and the Berkeley-MHAD data sets using different modalities based on optical motion capture, Kinect depth videos, and 3D motion history volumes. In each case high recognition accuracy is achieved in comparison to the symmetric bilinear modeling and the Nearest Neighbor classification. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:130 / 138
页数:9
相关论文
共 23 条
[1]  
Cheema M., 2012, IEEE INT C IM PROC
[2]   Mood swings: Expressive speech animation [J].
Chuang, E ;
Bregler, C .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (02) :331-347
[3]   RECOGNIZING FRIENDS BY THEIR WALK - GAIT PERCEPTION WITHOUT FAMILIARITY CUES [J].
CUTTING, JE ;
KOZLOWSKI, LT .
BULLETIN OF THE PSYCHONOMIC SOCIETY, 1977, 9 (05) :353-356
[4]  
Cuzzolin F., 2006, IEEE INT C COMP VIS
[5]  
ELGAMMAL A, 2004, IEEE INT C COMP VIS
[6]  
Escalante H. J., 2012, PRINCIPAL MOTION PCA
[7]  
Heng Wang, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3169, DOI 10.1109/CVPR.2011.5995407
[8]  
Iosifidis A, 2011, EUR SIGNAL PR CONF, P1974
[9]  
Krausz B., 2010, INT C PATT REC
[10]  
Kuehne H, 2011, IEEE I CONF COMP VIS, P2556, DOI 10.1109/ICCV.2011.6126543