Human action recognition using Local Spatio-Temporal Discriminant Embedding

被引：0

作者：

Jia, Kui ^{[1
]}

Yeung, Dit-Yan ^{[2
]}

机构：

[1] CAS CUHK, Shenzhen Inst Adv Integrat Technol, Shenzhen, Peoples R China

[2] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China

来源：

2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action video sequences can be considered as nonlinear dynamic shape manifolds in the space of image frames. In this paper, we address learning and classifying human actions on embedded low-dimensional manifolds. We propose a novel manifold embedding method, called Local Spatio-Temporal Discriminant Embedding (LSTDE). The discriminating capabilities of the proposed method are two-fold: (1) for local spatial discrimination, LSTDE projects data points (silhouette-based image frames of human action sequences) in a local neighborhood into the embedding space where data points of the same action class are close while those of different classes are far apart; (2) in such a local neighborhood, each data point has an associated short video segment, which forms a local temporal subspace on the embedded manifold. LSTDE finds an optimal embedding which maximizes the principal angles between those temporal subspaces associated with data points of different classes. Benefiting from the joint spatio-temporal discriminant embedding, our method is potentially more powerful for classifying human actions with similar space-time shapes, and is able to perform recognition on a frame-by-frame or short video segment basis. Experimental results demonstrate that our method can accurately recognize human actions, and can improve the recognition performance over some representative manifold embedding methods, especially on highly confusing human action types.

引用

页码：3040 / +

页数：2

共 50 条

[1] Human Action Recognition Using Spatio-temporal Classification
Fang, Chin-Hsien
Chen, Ju-Chin
Tseng, Chien-Chung
Lien, Jenn-Jier James
COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 98 - 109
[2] Local Spatio-Temporal Interest Point Detection for Human Action Recognition
Li, Feng
Du, Jixiang
2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 579 - 582
[3] Spatio-temporal information for human action recognition
Yao, Li
Liu, Yunjian
Huang, Shihui
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
[4] Spatio-temporal information for human action recognition
Li Yao
Yunjian Liu
Shihui Huang
EURASIP Journal on Image and Video Processing, 2016
[5] Silhouette analysis for human action recognition based on maximum spatio-temporal dissimilarity embedding
Jian Cheng
Haijun Liu
Hongsheng Li
Machine Vision and Applications, 2014, 25 : 1007 - 1018
[6] Silhouette analysis for human action recognition based on maximum spatio-temporal dissimilarity embedding
Cheng, Jian
Liu, Haijun
Li, Hongsheng
MACHINE VISION AND APPLICATIONS, 2014, 25 (04) : 1007 - 1018
[7] VIDEO ACTION RECOGNITION WITH SPATIO-TEMPORAL GRAPH EMBEDDING AND SPLINE MODELING
Yuan, Yin
Zheng, Haomian
Li, Zhu
Zhang, David
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2422 - 2425
[8] Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks
Sun, Lin
Jia, Kui
Yeung, Dit-Yan
Shi, Bertram E.
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4597 - 4605
[9] Spatio-Temporal Steerable Pyramid for Human Action Recognition
Zhen, Xiantong
Shao, Ling
2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
[10] Spatio-temporal Video Autoencoder for Human Action Recognition
Sousa e Santos, Anderson Carlos
Pedrini, Helio
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123

← 1 2 3 4 5 →