A Novel Action Recognition Scheme Based on Spatial-Temporal Pyramid Model

被引：0

作者：

Zhao, Hengying ^{[1
]}

Xiang, Xinguang ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

来源：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II | 2018年 / 10736卷

关键词：

Action recognition; Spatial-temporal; Multi-scale; Visual dictionary; DENSE;

D O I：

10.1007/978-3-319-77383-4_21

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recognizing actions is one of the most important challenges in computer vision. In this paper, we propose a novel action recognition scheme based on spatial-temporal pyramid model. Firstly, we extract the basic visual feature descriptors for each video. Secondly, we construct visual dictionary on the whole visual features set. Thirdly, we construct a novel spatial-temporal pyramid model by dividing the visual features set of each video into multi-scale blocks in 2-dimensional space domain and 1-dimensional time domain separately. Then we calculate the distribution histogram representation for each block of different scales by using the bag-of-features model and our new visual dictionary. At last, we normalize the final descriptors for videos and then recognize the actions using SVM. Experimental results show that our scheme achieves more accurate for action recognition compared with several state-of-the-art methods.

引用

页码：212 / 221

页数：10

共 50 条

[21] Deep Spatial-Temporal Model Based Cross-Scene Action Recognition Using Commodity WiFi
Sheng, Biyun
Xiao, Fu
Sha, Letian
Sun, Lijuan
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (04) : 3592 - 3601
[22] A Spatial-Temporal Feature Fusion Strategy for Skeleton-Based Action Recognition
Chen, Yitian
Xu, Yuchen
Xie, Qianglai
Xiong, Lei
Yao, Leiyue
2023 INTERNATIONAL CONFERENCE ON DATA SECURITY AND PRIVACY PROTECTION, DSPP, 2023, : 207 - 215
[23] ST-HViT: spatial-temporal hierarchical vision transformer for action recognition
Xia, Limin
Fu, Weiye
PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
[24] An End-to-End Spatial-Temporal Transformer Model for Surgical Action Triplet Recognition
Zou, Xiaoyang
Yu, Derong
Tao, Rong
Zheng, Guoyan
12TH ASIAN-PACIFIC CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, VOL 2, APCMBE 2023, 2024, 104 : 114 - 120
[25] Multi-Branch Spatial-Temporal Network for Action Recognition
Wang, Yingying
Li, Wei
Tao, Ran
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1556 - 1560
[26] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
Du, Wenbin
Wang, Yali
Qiao, Yu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
[27] Human action recognition based on multi-mode spatial-temporal feature fusion
Wang, Dongli
Yang, Jun
Zhou, Yan
2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
[28] STSD: spatial-temporal semantic decomposition transformer for skeleton-based action recognition
Cui, Hu
Hayama, Tessai
MULTIMEDIA SYSTEMS, 2024, 30 (01)
[29] Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition
Rahevar, Mrugendrasinh
Ganatra, Amit
Saba, Tanzila
Rehman, Amjad
Bahaj, Saeed Ali
IEEE ACCESS, 2023, 11 : 21546 - 21553
[30] Spatial-Temporal gated graph attention network for skeleton-based action recognition
Rahevar, Mrugendrasinh
Ganatra, Amit
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 929 - 939

← 1 2 3 4 5 →