Silhouette-based gesture and action recognition via modeling trajectories on Riemannian shape manifolds

被引：51

作者：

Abdelkader, Mohamed F. ^{[1
,2
]}

Abd-Almageed, Wael ^{[1
,2
]}

Srivastava, Anuj ^{[3
]}

Chellappa, Rama ^{[1
,2
]}

机构：

[1] Univ Maryland, UMIACS, Dept Elect & Comp Engn, College Pk, MD 20742 USA

[2] Univ Maryland, UMIACS, Ctr Automat Res, College Pk, MD 20742 USA

[3] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2011年 / 115卷 / 03期

关键词：

Gesture recognition; Action recognition; Riemannian manifolds; Shape space; Silhouette-based approaches; HIDDEN MARKOV-MODELS; HUMAN MOVEMENT; MOTION; VIDEO;

D O I：

10.1016/j.cviu.2010.10.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the problem of recognizing human gestures from videos using models that are built from the Riemannian geometry of shape spaces. We represent a human gesture as a temporal sequence of human poses, each characterized by a contour of the associated human silhouette. The shape of a contour is viewed as a point on the shape space of closed curves and, hence, each gesture is characterized and modeled as a trajectory on this shape space. We propose two approaches for modeling these trajectories. In the first template-based approach, we use dynamic time warping (DTW) to align the different trajectories using elastic geodesic distances on the shape space. The gesture templates are then calculated by averaging the aligned trajectories. In the second approach, we use a graphical model approach similar to an exemplar-based hidden Markov model, where we cluster the gesture shapes on the shape space, and build non-parametric statistical models to capture the variations within each cluster. We model each gesture as a Markov model of transitions between these clusters. To evaluate the proposed approaches, an extensive set of experiments was performed using two different data sets representing gesture and action recognition applications. The proposed approaches not only are successfully able to represent the shape and dynamics of the different classes for recognition, but are also robust against some errors resulting from segmentation and background subtraction. (C) 2010 Elsevier Inc. All rights reserved.

引用

页码：439 / 455

页数：17

共 67 条

[1]

Aggarwal JK, 2004, 2ND INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, P640

[2]

[Anonymous], 1998, Statistical shape analysis

[3]

[Anonymous], P IEEE COMPUT SOC C

[4]

[Anonymous], 2006, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2006.50, DOI 10.1109/CVPR.2006.50]

[5]

[Anonymous], CVPR, DOI DOI 10.1109/CVPR.2008.4587733

[6]

[Anonymous], 1993, General pattern theory

[7] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].

BAUM, LE ;

PETRIE, T ;

SOULES, G ;

WEISS, N .

ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&

[8] Shape matching and object recognition using shape contexts [J].

Belongie, S ;

Malik, J ;

Puzicha, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522

[9]

BHUIYAN MA, 2007, 10 INT C COMP INF TE, P1

[10] The recognition of human movement using temporal templates [J].

Bobick, AF ;

Davis, JW .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267

← 1 2 3 4 5 6 7 →