Sparse Coding on Local Spatial-Temporal Volumes for Human Action Recognition

被引:0
|
作者
Zhu, Yan [1 ]
Zhao, Xu [1 ]
Fu, Yun [2 ]
Liu, Yuncai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] SUNY Buffalo, Dept CSE, Buffalo, NY 14260 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By extracting local spatial-temporal features from videos, many recently proposed approaches for action recognition achieve promising performance. The Bag-of-Words (BoW) model is commonly used in the approaches to obtain the video level representations. However, BoW model roughly assigns each feature vector to its closest visual word, therefore inevitably causing nontrivial quantization errors and impairing further improvements on classification rates. To obtain a more accurate and discriminative representation, in this paper, we propose an approach for action recognition by encoding local 3D spatial-temporal gradient features within the sparse coding framework. In so doing, each local spatial-temporal feature is transformed to a linear combination of a few "atoms" in a trained dictionary. In addition, we also investigate the construction of the dictionary under the guidance of transfer learning. We collect a large set of diverse video clips of sport games and movies, from which a set of universal atoms composed of the dictionary are learned by an online learning strategy. We test our approach on KTH dataset and UCF sports dataset. Experimental results demonstrate that our approach outperforms the state-of-art techniques on KTH dataset and achieves the comparable performance on UCF sports dataset.
引用
收藏
页码:660 / +
页数:3
相关论文
共 50 条
  • [1] Action Recognition Based on Spatial-Temporal Pyramid Sparse Coding
    Zhang, Xiaojing
    Zhang, Hua
    Cao, Xiaochun
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1455 - 1458
  • [2] Statistics on Temporal Changes of Sparse Coding Coefficients in Spatial Pyramids for Human Action Recognition
    Li, Yang
    Ye, Junyong
    Wang, Tongqing
    Huang, Shijian
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (09) : 1711 - 1714
  • [3] StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
    He, Dongliang
    Zhou, Zhichao
    Gan, Chuang
    Li, Fu
    Liu, Xiao
    Li, Yandong
    Wang, Limin
    Wen, Shilei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8401 - 8408
  • [4] Spatial-Temporal Attention for Action Recognition
    Sun, Dengdi
    Wu, Hanqing
    Ding, Zhuanlian
    Luo, Bin
    Tang, Jin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 854 - 864
  • [5] Sparse representation of local spatial-temporal features with dimensionality reduction for motion recognition
    Wang, Jin
    Sun, Xiangping
    Liu, Ping
    She, Mary F. H.
    Kong, Lingxue
    NEUROCOMPUTING, 2013, 115 : 150 - 160
  • [6] Human action recognition in videos using distance image volumes and sparse coding
    Liu, Y. (yang97_net@163.com), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (08):
  • [7] Local Spatiotemporal Coding and Sparse Representation based Human Action Recognition
    Wang, Bin
    Liu, Yu
    Wang, Wei
    Xu, Wei
    Zhang, Maojun
    FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY III, PTS 1-3, 2013, 401 : 1555 - 1560
  • [8] Joint spatial-temporal attention for action recognition
    Yu, Tingzhao
    Guo, Chaoxu
    Wang, Lingfeng
    Gu, Huxiang
    Xiang, Shiming
    Pan, Chunhong
    PATTERN RECOGNITION LETTERS, 2018, 112 : 226 - 233
  • [9] Spatial-Temporal Neural Networks for Action Recognition
    Jing, Chao
    Wei, Ping
    Sun, Hongbin
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 619 - 627
  • [10] Spatial-temporal pooling for action recognition in videos
    Wang, Jiaming
    Shao, Zhenfeng
    Huang, Xiao
    Lu, Tao
    Zhang, Ruiqian
    Lv, Xianwei
    NEUROCOMPUTING, 2021, 451 : 265 - 278