A Novel Action Recognition Scheme Based on Spatial-Temporal Pyramid Model

被引:0
|
作者
Zhao, Hengying [1 ]
Xiang, Xinguang [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
关键词
Action recognition; Spatial-temporal; Multi-scale; Visual dictionary; DENSE;
D O I
10.1007/978-3-319-77383-4_21
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing actions is one of the most important challenges in computer vision. In this paper, we propose a novel action recognition scheme based on spatial-temporal pyramid model. Firstly, we extract the basic visual feature descriptors for each video. Secondly, we construct visual dictionary on the whole visual features set. Thirdly, we construct a novel spatial-temporal pyramid model by dividing the visual features set of each video into multi-scale blocks in 2-dimensional space domain and 1-dimensional time domain separately. Then we calculate the distribution histogram representation for each block of different scales by using the bag-of-features model and our new visual dictionary. At last, we normalize the final descriptors for videos and then recognize the actions using SVM. Experimental results show that our scheme achieves more accurate for action recognition compared with several state-of-the-art methods.
引用
收藏
页码:212 / 221
页数:10
相关论文
共 50 条
  • [41] Hierarchy Spatial-Temporal Transformer for Action Recognition in Short Videos
    Cai, Guoyong
    Cai, Yumeng
    FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 760 - 774
  • [42] Hierarchical Spatial-Temporal Masked Contrast for Skeleton Action Recognition
    Cao, Wenming
    Zhang, Aoyu
    He, Zhihai
    Zhang, Yicha
    Yin, Xinpeng
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 5801 - 5814
  • [43] Multi-Branch Spatial-Temporal Network for Action Recognition
    Wang, Yingying
    Li, Wei
    Tao, Ran
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1556 - 1560
  • [44] StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
    He, Dongliang
    Zhou, Zhichao
    Gan, Chuang
    Li, Fu
    Liu, Xiao
    Li, Yandong
    Wang, Limin
    Wen, Shilei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8401 - 8408
  • [45] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
  • [46] Action Recognition Using a Spatial-Temporal Network for Wild Felines
    Feng, Liqi
    Zhao, Yaqin
    Sun, Yichao
    Zhao, Wenxuan
    Tang, Jiaxi
    ANIMALS, 2021, 11 (02): : 1 - 18
  • [47] Actionmamba: Action Spatial-Temporal Aggregation Network Based on Mamba and Gcn for Skeleton-Based Action Recognition
    North University of China, School of Electrical and Control Engineering, Shanxi, Taiyuan
    030051, China
  • [48] Spatial-temporal graph neural ODE networks for skeleton-based action recognition
    Pan, Longji
    Lu, Jianguang
    Tang, Xianghong
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [49] A Separable Spatial-Temporal Graph Learning Approach for Skeleton-Based Action Recognition
    Zheng, Hui
    Zhao, Ye-Sheng
    Zhang, Bo
    Shang, Guo-Qiang
    IEEE SENSORS LETTERS, 2024, 8 (11)
  • [50] Human action recognition based on multi-mode spatial-temporal feature fusion
    Wang, Dongli
    Yang, Jun
    Zhou, Yan
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,