Mining Actionlet Ensemble for Action Recognition with Depth Cameras

被引：1093

作者：

Wang, Jiang ^{[1
]}

Liu, Zicheng ^{[2
]}

Wu, Ying ^{[1
]}

Yuan, Junsong ^{[3
]}

机构：

[1] Northwestern Univ, Evanston, IL 60208 USA

[2] Microsoft Res, Evanston, IL 60208 USA

[3] Nanyang Technol Univ, Singapore 639798, Singapore

来源：

2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2012年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/CVPR.2012.6247813

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action recognition is an important yet challenging task. The recently developed commodity depth sensors open up new possibilities of dealing with this problem but also present some unique challenges. The depth maps captured by the depth cameras are very noisy and the 3D positions of the tracked joints may be completely wrong if serious occlusions occur, which increases the intra-class variations in the actions. In this paper, an actionlet ensemble model is learnt to represent each action and to capture the intra-class variance. In addition, novel features that are suitable for depth data are proposed. They are robust to noise, invariant to translational and temporal misalignments, and capable of characterizing both the human motion and the human-object interactions. The proposed approach is evaluated on two challenging action recognition datasets captured by commodity depth cameras, and another dataset captured by a MoCap system. The experimental evaluations show that the proposed approach achieves superior performance to the state of the art algorithms.

引用

页码：1290 / 1297

页数：8

共 24 条

[1]

Agrawal R., P 20 INT C VERY LARG

[2]

[Anonymous], 2010, P CVPR

[3]

[Anonymous], 2011, CVPR

[4]

[Anonymous], CMU GRAPH LAB MOT CA

[5]

[Anonymous], 2006, 4 ACM INT WORKSH VID, DOI DOI 10.1145/1178782.1178808

[6]

[Anonymous], ICCV

[7]

[Anonymous], 1999, Prentice-Hall Signal Processing Series

[8] Choosing multiple parameters for support vector machines [J].

Chapelle, O ;

Vapnik, V ;

Bousquet, O ;

Mukherjee, S .

MACHINE LEARNING, 2002, 46 (1-3) :131-159

[9]

Dai S., 2007, Computer Vision and Pattern Recognition, P1

[10] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

← 1 2 3 →