Silhouette-based gesture and action recognition via modeling trajectories on Riemannian shape manifolds

被引:51
作者
Abdelkader, Mohamed F. [1 ,2 ]
Abd-Almageed, Wael [1 ,2 ]
Srivastava, Anuj [3 ]
Chellappa, Rama [1 ,2 ]
机构
[1] Univ Maryland, UMIACS, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, UMIACS, Ctr Automat Res, College Pk, MD 20742 USA
[3] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
关键词
Gesture recognition; Action recognition; Riemannian manifolds; Shape space; Silhouette-based approaches; HIDDEN MARKOV-MODELS; HUMAN MOVEMENT; MOTION; VIDEO;
D O I
10.1016/j.cviu.2010.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of recognizing human gestures from videos using models that are built from the Riemannian geometry of shape spaces. We represent a human gesture as a temporal sequence of human poses, each characterized by a contour of the associated human silhouette. The shape of a contour is viewed as a point on the shape space of closed curves and, hence, each gesture is characterized and modeled as a trajectory on this shape space. We propose two approaches for modeling these trajectories. In the first template-based approach, we use dynamic time warping (DTW) to align the different trajectories using elastic geodesic distances on the shape space. The gesture templates are then calculated by averaging the aligned trajectories. In the second approach, we use a graphical model approach similar to an exemplar-based hidden Markov model, where we cluster the gesture shapes on the shape space, and build non-parametric statistical models to capture the variations within each cluster. We model each gesture as a Markov model of transitions between these clusters. To evaluate the proposed approaches, an extensive set of experiments was performed using two different data sets representing gesture and action recognition applications. The proposed approaches not only are successfully able to represent the shape and dynamics of the different classes for recognition, but are also robust against some errors resulting from segmentation and background subtraction. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:439 / 455
页数:17
相关论文
共 67 条