Action recognition using lie algebrized gaussians over dense local spatio-temporal features

被引:5
|
作者
Chen, Meng [1 ]
Gong, Liyu [2 ]
Wang, Tianjiang [1 ]
Feng, Qi [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Eedoo Inc, Beijing 100085, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Dense sampling; Local spatio-temporal feature; Gaussian mixture model; Lie algebrized gaussians;
D O I
10.1007/s11042-013-1746-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel framework for human action recognition based on a newly proposed mid-level feature representation method named Lie Algebrized Guassians (LAG). As an action sequence can be treated as a 3D object in space-time space, we address the action recognition problem by recognizing 3D objects and characterize 3D objects by the probability distributions of local spatio-temporal features. First, for each video, we densely sample local spatio-temporal features (e.g. HOG3D) at multiple scales confined in bounding boxes of human body. Moreover, normalized spatial coordinates are appended to local descriptor in order to capture spatial position information. Then the distribution of local features in each video is modeled by a Gaussian Mixture Model (GMM). To estimate the parameters of video-specific GMMs, a global GMM is trained using all training data and video-specific GMMs are adapted from the global GMM. Then the LAG is adopted to vectorize those video-specific GMMs. Finally, linear SVM is employed for classification. Experimental results on the KTH and UCF Sports dataset show that our method achieves state-of-the-art performance.
引用
收藏
页码:2127 / 2142
页数:16
相关论文
共 50 条
  • [1] Action recognition using lie algebrized gaussians over dense local spatio-temporal features
    Meng Chen
    Liyu Gong
    Tianjiang Wang
    Qi Feng
    Multimedia Tools and Applications, 2015, 74 : 2127 - 2142
  • [2] Modeling spatio-temporal layout with Lie Algebrized Gaussians for action recognition
    Chen, Meng
    Gong, Liyu
    Wang, Tianjiang
    Liu, Fang
    Feng, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (17) : 10335 - 10355
  • [3] Modeling spatio-temporal layout with Lie Algebrized Gaussians for action recognition
    Meng Chen
    Liyu Gong
    Tianjiang Wang
    Fang Liu
    Qi Feng
    Multimedia Tools and Applications, 2016, 75 : 10335 - 10355
  • [4] ACTION RECOGNITION BY ORTHOGONALIZED SUBSPACES OF LOCAL SPATIO-TEMPORAL FEATURES
    Raytchev, Bisser
    Shigenaka, Ryosuke
    Tamaki, Toru
    Kaneda, Kazufumi
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4387 - 4391
  • [5] Fast Realistic Multi-Action Recognition using Mined Dense Spatio-temporal Features
    Gilbert, Andrew
    Illingworth, John
    Bowden, Richard
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 925 - 931
  • [6] Action recognition via spatio-temporal local features: A comprehensive study
    Zhen, Xiantong
    Shao, Ling
    IMAGE AND VISION COMPUTING, 2016, 50 : 1 - 13
  • [7] Action Recognition via an Improved Local Descriptor for Spatio-temporal Features
    Yang, Kai
    Du, Ji-Xiang
    Zhai, Chuan-Min
    ADVANCED INTELLIGENT COMPUTING, 2011, 6838 : 234 - 241
  • [8] Action Recognition Using Discriminative Spatio-Temporal Neighborhood Features
    Cheng, Shi-Lei
    Yang, Jiang-Feng
    Ma, Zheng
    Xie, Mei
    INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND INFORMATION SECURITY (CNIS 2015), 2015, : 166 - 172
  • [9] Action recognition using spatio-temporal regularity based features
    Goodhart, Taylor
    Yan, Pingkun
    Shah, Mubarak
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 745 - 748
  • [10] Scale Invariant Action Recognition Using Compound Features Mined from Dense Spatio-temporal Corners
    Gilbert, Andrew
    Illingworth, John
    Bowden, Richard
    COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 222 - 233