A Novel Action Recognition Scheme Based on Spatial-Temporal Pyramid Model

被引:0
|
作者
Zhao, Hengying [1 ]
Xiang, Xinguang [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II | 2018年 / 10736卷
关键词
Action recognition; Spatial-temporal; Multi-scale; Visual dictionary; DENSE;
D O I
10.1007/978-3-319-77383-4_21
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing actions is one of the most important challenges in computer vision. In this paper, we propose a novel action recognition scheme based on spatial-temporal pyramid model. Firstly, we extract the basic visual feature descriptors for each video. Secondly, we construct visual dictionary on the whole visual features set. Thirdly, we construct a novel spatial-temporal pyramid model by dividing the visual features set of each video into multi-scale blocks in 2-dimensional space domain and 1-dimensional time domain separately. Then we calculate the distribution histogram representation for each block of different scales by using the bag-of-features model and our new visual dictionary. At last, we normalize the final descriptors for videos and then recognize the actions using SVM. Experimental results show that our scheme achieves more accurate for action recognition compared with several state-of-the-art methods.
引用
收藏
页码:212 / 221
页数:10
相关论文
共 50 条
  • [21] Deep Spatial-Temporal Model Based Cross-Scene Action Recognition Using Commodity WiFi
    Sheng, Biyun
    Xiao, Fu
    Sha, Letian
    Sun, Lijuan
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (04) : 3592 - 3601
  • [22] A Spatial-Temporal Feature Fusion Strategy for Skeleton-Based Action Recognition
    Chen, Yitian
    Xu, Yuchen
    Xie, Qianglai
    Xiong, Lei
    Yao, Leiyue
    2023 INTERNATIONAL CONFERENCE ON DATA SECURITY AND PRIVACY PROTECTION, DSPP, 2023, : 207 - 215
  • [23] ST-HViT: spatial-temporal hierarchical vision transformer for action recognition
    Xia, Limin
    Fu, Weiye
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [24] An End-to-End Spatial-Temporal Transformer Model for Surgical Action Triplet Recognition
    Zou, Xiaoyang
    Yu, Derong
    Tao, Rong
    Zheng, Guoyan
    12TH ASIAN-PACIFIC CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, VOL 2, APCMBE 2023, 2024, 104 : 114 - 120
  • [25] Multi-Branch Spatial-Temporal Network for Action Recognition
    Wang, Yingying
    Li, Wei
    Tao, Ran
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1556 - 1560
  • [26] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
  • [27] Human action recognition based on multi-mode spatial-temporal feature fusion
    Wang, Dongli
    Yang, Jun
    Zhou, Yan
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [28] STSD: spatial-temporal semantic decomposition transformer for skeleton-based action recognition
    Cui, Hu
    Hayama, Tessai
    MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [29] Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    Saba, Tanzila
    Rehman, Amjad
    Bahaj, Saeed Ali
    IEEE ACCESS, 2023, 11 : 21546 - 21553
  • [30] Spatial-Temporal gated graph attention network for skeleton-based action recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 929 - 939