Action recognition from depth sequences using weighted fusion of 2D and 3D auto-correlation of gradients features

被引:35
作者
Chen, Chen [1 ]
Zhang, Baochang [2 ]
Hou, Zhenjie [3 ]
Jiang, Junjun [4 ]
Liu, Mengyuan [5 ]
Yang, Yun [2 ]
机构
[1] Univ Texas Dallas, Dept Elect Engn, Richardson, TX 75080 USA
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[3] Changzhou Univ, Sch Informat Sci & Engn, Changzhou, Peoples R China
[4] China Univ Geosci, Sch Comp Sci, Wuhan, Peoples R China
[5] Peking Univ, Shenzhen Grad Sch, Engn Lab Intelligent Percept Internet Things ELIP, Shenzhen, Peoples R China
关键词
Action recognition; Depth data; Depth motion maps; Gradient local autocorrelations; Space-time auto-correlation of gradients; Extreme learning machine; Weighted fusion; HISTOGRAMS;
D O I
10.1007/s11042-016-3284-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new framework for human action recognition from depth sequences. An effective depth feature representation is developed based on the fusion of 2D and 3D auto-correlation of gradients features. Specifically, depth motion maps (DMMs) are first employed to transform a depth sequence into three images capturing shape and motion cues. A feature extraction method utilizing spatial and orientational auto-correlations of image local gradients is introduced to extract features from DMMs. Space-time auto-correlation of gradients features are also extracted from depth sequences as complementary features to cope with the temporal information loss in the DMMs generation. Each set of features is used as input to two extreme learning machine classifiers to generate probability outputs. A weighted fusion strategy is proposed to assign different weights to the classifier probability outputs associated with different features, thereby providing more flexibility in the final decision making. The proposed method is evaluated on two depth action datasets (MSR Action 3D and MSR Gesture 3D) and obtains the state-of-the-art recognition performance (94.87 % for the MSR Action 3D and 98.50 % for the MSR Gesture 3D).
引用
收藏
页码:4651 / 4669
页数:19
相关论文
共 39 条
  • [1] Human activity recognition from 3D data: A review
    Aggarwal, J. K.
    Xia, Lu
    [J]. PATTERN RECOGNITION LETTERS, 2014, 48 : 70 - 80
  • [2] [Anonymous], PATTERN RECOGNITION
  • [3] [Anonymous], 2013, P 23 INT JOINT C ART
  • [4] Chen C, 2015, LAND USE SCENE CLASS
  • [5] Gradient Local Auto-Correlations and Extreme Learning Machine for Depth-Based Activity Recognition
    Chen, Chen
    Hou, Zhenjie
    Zhang, Baochang
    Jiang, Junjun
    Yang, Yun
    [J]. ADVANCES IN VISUAL COMPUTING, PT I (ISVC 2015), 2015, 9474 : 613 - 623
  • [6] Gabor-Filtering-Based Completed Local Binary Patterns for Land-Use Scene Classification
    Chen, Chen
    Zhou, Libing
    Guo, Jianzhong
    Li, Wei
    Su, Hongjun
    Guo, Fangda
    [J]. 2015 1ST IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2015, : 324 - 329
  • [7] Action Recognition from Depth Sequences Using Depth Motion Maps-based Local Binary Patterns
    Chen, Chen
    Jafari, Roozbeh
    Kehtarnavaz, Nasser
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1092 - 1099
  • [8] Improving Human Action Recognition Using Fusion of Depth Camera and Inertial Sensors
    Chen, Chen
    Jafari, Roozbeh
    Kehtarnavaz, Nasser
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (01) : 51 - 61
  • [9] Chen C, 2014, IEEE ENG MED BIO, P4135, DOI 10.1109/EMBC.2014.6944534
  • [10] Chen C, 2014, IEEE ENG MED BIO, P4983, DOI 10.1109/EMBC.2014.6944743