Action recognition from depth sequences using weighted fusion of 2D and 3D auto-correlation of gradients features

被引：37

作者：

Chen, Chen ^{[1
]}

Zhang, Baochang ^{[2
]}

Hou, Zhenjie ^{[3
]}

Jiang, Junjun ^{[4
]}

Liu, Mengyuan ^{[5
]}

Yang, Yun ^{[2
]}

机构：

[1] Univ Texas Dallas, Dept Elect Engn, Richardson, TX 75080 USA

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China

[3] Changzhou Univ, Sch Informat Sci & Engn, Changzhou, Peoples R China

[4] China Univ Geosci, Sch Comp Sci, Wuhan, Peoples R China

[5] Peking Univ, Shenzhen Grad Sch, Engn Lab Intelligent Percept Internet Things ELIP, Shenzhen, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2017年 / 76卷 / 03期

关键词：

Action recognition; Depth data; Depth motion maps; Gradient local autocorrelations; Space-time auto-correlation of gradients; Extreme learning machine; Weighted fusion; HISTOGRAMS;

D O I：

10.1007/s11042-016-3284-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a new framework for human action recognition from depth sequences. An effective depth feature representation is developed based on the fusion of 2D and 3D auto-correlation of gradients features. Specifically, depth motion maps (DMMs) are first employed to transform a depth sequence into three images capturing shape and motion cues. A feature extraction method utilizing spatial and orientational auto-correlations of image local gradients is introduced to extract features from DMMs. Space-time auto-correlation of gradients features are also extracted from depth sequences as complementary features to cope with the temporal information loss in the DMMs generation. Each set of features is used as input to two extreme learning machine classifiers to generate probability outputs. A weighted fusion strategy is proposed to assign different weights to the classifier probability outputs associated with different features, thereby providing more flexibility in the final decision making. The proposed method is evaluated on two depth action datasets (MSR Action 3D and MSR Gesture 3D) and obtains the state-of-the-art recognition performance (94.87 % for the MSR Action 3D and 98.50 % for the MSR Gesture 3D).

引用

页码：4651 / 4669

页数：19

共 39 条

[1] Human activity recognition from 3D data: A review [J].

Aggarwal, J. K. ;

Xia, Lu .

PATTERN RECOGNITION LETTERS, 2014, 48 :70-80

[2]

[Anonymous], PATTERN RECOGNITION

[3]

[Anonymous], 2013, P 23 INT JOINT C ART

[4]

Chen C, 2015, LAND USE SCENE CLASS

[5] Gradient Local Auto-Correlations and Extreme Learning Machine for Depth-Based Activity Recognition [J].

Chen, Chen ;

Hou, Zhenjie ;

Zhang, Baochang ;

Jiang, Junjun ;

Yang, Yun .

ADVANCES IN VISUAL COMPUTING, PT I (ISVC 2015), 2015, 9474 :613-623

[6] Gabor-Filtering-Based Completed Local Binary Patterns for Land-Use Scene Classification [J].

Chen, Chen ;

Zhou, Libing ;

Guo, Jianzhong ;

Li, Wei ;

Su, Hongjun ;

Guo, Fangda .

2015 1ST IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2015, :324-329

[7] Action Recognition from Depth Sequences Using Depth Motion Maps-based Local Binary Patterns [J].

Chen, Chen ;

Jafari, Roozbeh ;

Kehtarnavaz, Nasser .

2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, :1092-1099

[8] Improving Human Action Recognition Using Fusion of Depth Camera and Inertial Sensors [J].

Chen, Chen ;

Jafari, Roozbeh ;

Kehtarnavaz, Nasser .

IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (01) :51-61

[9]

Chen C, 2014, IEEE ENG MED BIO, P4135, DOI 10.1109/EMBC.2014.6944534

[10]

Chen C, 2014, IEEE ENG MED BIO, P4983, DOI 10.1109/EMBC.2014.6944743

← 1 2 3 4 →