Actions as space-time shapes

被引:987
作者
Gorelick, Lena
Blank, Moshe
Shechtman, Eli
Irani, Michal
Basri, Ronen
机构
[1] Weizmann Inst Sci, Dept Comp Sci & Appl Math, IL-76100 Rehovot, Israel
[2] EyeClick Ltd, IL-52521 Ramat Gan, Israel
基金
以色列科学基金会;
关键词
action representation; action recognition; space-time analysis; shape analysis; poisson equation;
D O I
10.1109/TPAMI.2007.70711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human action in video sequences can be seen as silhouettes of a moving torso and protruding limbs undergoing articulated motion. We regard human actions as three- dimensional shapes induced by the silhouettes in the space- time volume. We adopt a recent approach [ 14] for analyzing 2D shapes and generalize it to deal with volumetric space- time action shapes. Our method utilizes properties of the solution to the Poisson equation to extract space- time features such as local space- time saliency, action dynamics, shape structure, and orientation. We show that these features are useful for action recognition, detection, and clustering. The method is fast, does not require video alignment, and is applicable in ( but not limited to) many scenarios where the background is known. Moreover, we demonstrate the robustness of our method to partial occlusions, nonrigid deformations, significant changes in scale and viewpoint, high irregularities in the performance of an action, and low- quality video.
引用
收藏
页码:2247 / 2253
页数:7
相关论文
共 32 条
[1]  
[Anonymous], CVPR 01
[2]  
[Anonymous], 2000, P 12 C FRANC AFRIF A
[3]  
[Anonymous], P COMP VIS PATT REC
[4]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[5]   INVARIANT SURFACE CHARACTERISTICS FOR 3D OBJECT RECOGNITION IN RANGE IMAGES [J].
BESL, PJ ;
JAIN, RC .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1986, 33 (01) :33-80
[6]  
BLACK MJ, 1999, CVPR, V1, P1326
[7]  
Blank M, 2005, IEEE I CONF COMP VIS, P1395
[8]  
Blum H., 1967, Models for the Perception of Speech and Visual Forms, P362, DOI DOI 10.1142/S0218654308001154
[9]   The recognition of human movement using temporal templates [J].
Bobick, AF ;
Davis, JW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267
[10]  
CARLSSON S, 1999, P INT WORKSH SHAP CO, P1681