Robust human activity recognition from depth video using spatiotemporal multi-fused features

被引：258

作者：

Jalal, Ahmad ^{[1
]}

Kim, Yeon-Ho ^{[1
]}

Kim, Yong-Joong ^{[1
]}

Kamal, Shaharyar ^{[2
]}

Kim, Daijin ^{[1
]}

机构：

[1] POSTECH, Dept Comp Sci & Engn, San 31, Pohang 790784, South Korea

[2] Kyung Hee Univ, Dept Elect & Radio Engn, Yongin 446701, South Korea

来源：

PATTERN RECOGNITION | 2017年 / 61卷

关键词：

Human activity recognition; Depth silhouette; Skeleton joint extraction; Spatiotemporal multi-fused feature extraction; Hidden Markov model; Forward spotting scheme; ALGORITHM; SYSTEM; POSE;

D O I：

10.1016/j.patcog.2016.08.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recently developed depth imaging technologies have provided new directions for human activity recognition (HAR) without attaching optical markers or any other motion sensors to human body parts. In this paper, we propose novel multi-fused features for online human activity recognition (HAR) system that recognizes human activities from continuous sequences of depth map. The proposed online HAR system segments human depth silhouettes using temporal human motion information as well as it obtains human skeleton joints using spatiotemporal human body information. Then, it extracts the spatiotemporal multi-fused features that concatenate four skeleton joint features and one body shape feature. Skeleton joint features include the torso-based distance feature (DT), the key joint-based distance feature (DK), the spatiotemporal magnitude feature (M) and the spatiotemporal directional angle feature (theta). The body shape feature called HOG-DDS represents the projections of the depth differential silhouettes (DDS) between two consecutive frames onto three orthogonal planes by the histogram of oriented gradients (HOG) format. The size of the proposed spatiotemporal multi-fused feature is reduced by a code vector in the code book which is generated by vector quantization method. Then, it trains the hidden Markov model (HMM) with the code vectors of the multi-fused features and recognizes the segmented human activity by the forward spotting scheme using the trained HMM-based human activity classifiers. The experimental results on three challenging depth video datasets such as IM-Daily-DepthActivity, MSRAction3D and MSRDailyActivity3D demonstrate that the proposed online HAR method using the proposed multi-fused features outperforms the state-of-the-art HAR methods in terms of recognition accuracy. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：295 / 308

页数：14

共 51 条

[1] Human activity recognition using multi-features and multiple kernel learning [J].

Althloothi, Salah ;

Mahoor, Mohammad H. ;

Zhang, Xiao ;

Voyles, Richard M. .

PATTERN RECOGNITION, 2014, 47 (05) :1800-1812

[2]

[Anonymous], P AS C COMP VIS

[3]

[Anonymous], 2013, P 23 INT JOINT C ART

[4]

[Anonymous], P IEEE INT C IM PROC

[5]

[Anonymous], P INT C INT MULT COM

[6]

[Anonymous], 2012, P ACM INT C MULT NAR, DOI DOI 10.1145/2393347.2396382

[7]

Baak A, 2011, IEEE I CONF COMP VIS, P1092, DOI 10.1109/ICCV.2011.6126356

[8] The recognition of human movement using temporal templates [J].

Bobick, AF ;

Davis, JW .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267

[9] Fuzzy Rule Inference Based Human Activity Recognition [J].

Chang, Jyh-Yeong ;

Shyu, Jia-Jye ;

Cho, Chien-Wen .

2009 IEEE CONTROL APPLICATIONS CCA & INTELLIGENT CONTROL (ISIC), VOLS 1-3, 2009, :211-215

[10]

Cheng ZW, 2012, LECT NOTES COMPUT SC, V7584, P52, DOI 10.1007/978-3-642-33868-7_6

← 1 2 3 4 5 6 →