Extracting hierarchical spatial and temporal features for human action recognition

被引：10

作者：

Zhang, Keting ^{[1
]}

Zhang, Liqing ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2018年 / 77卷 / 13期

基金：

中国国家自然科学基金;

关键词：

Hierarchical feature extraction; Dual-channel model; Subspace network; Spatial and temporal representation; Action recognition; PARALLEL FRAMEWORK; HEVC;

D O I：

10.1007/s11042-017-5179-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human action recognition is a challenging computer vision task and many efforts have been made to improve the performance. Most previous work has concentrated on the hand-crafted features or spatial-temporal features learned from multiple contiguous frames. In this paper, we present a dual-channel model to decouple the spatial and temporal feature extraction. More specifically, we propose to capture the complementary static form information from single frame and dynamic motion information from multi-frame differences in two separate channels. In both channels we use two stacked classical subspace networks to learn hierarchical representations, which are subsequently fused for action recognition. Our model is trained and evaluated on three typical benchmarks: KTH, UCF and Hollywood2 datasets. The experimental results illustrate that our approach achieves comparable performances to the state-of-the-art methods. In addition, both feature analysis and control experiments are also carried out to demonstrate the effectiveness of the proposed approach for feature extraction and thereby action recognition.

引用

页码：16053 / 16068

页数：16

共 50 条

[1] Extracting hierarchical spatial and temporal features for human action recognition
Keting Zhang
Liqing Zhang
Multimedia Tools and Applications, 2018, 77 : 16053 - 16068
[2] CASCADED TEMPORAL SPATIAL FEATURES FOR VIDEO ACTION RECOGNITION
Yu, Tingzhao
Gu, Huxiang
Wang, Lingfeng
Xiang, Shiming
Pan, Chunhong
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1552 - 1556
[3] MSAHTA: Mixed Spatial Attention and Hierarchical Temporal Aggregation for Action Recognition
Feng, Jinyuan
Yang, Dan
Ge, Yongxin
Qin, Xiaolei
Chen, Yida
Wang, Yuangan
2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 775 - 782
[4] HUMAN ACTION RECOGNITION VIA SPATIAL AND TEMPORAL METHODS
Eroglu, Hulusi
Gokce, C. Onur
Ilk, H. Gokhan
2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 104 - 107
[5] Combining Handcrafted Spatio-Temporal and Deep Spatial Features for Effective Human Action Recognition
R. Divya Rani
C. J. Prabhakar
Human-Centric Intelligent Systems, 2025, 5 (1): : 123 - 150
[6] Spatio-temporal Semantic Features for Human Action Recognition
Liu, Jia
Wang, Xiaonian
Li, Tianyu
Yang, Jie
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (10): : 2632 - 2649
[7] Human Action Recognition Using Spatial and Temporal Sequences Alignment
Li, Yandi
Zhao, Zhihao
SECOND INTERNATIONAL CONFERENCE ON OPTICS AND IMAGE PROCESSING (ICOIP 2022), 2022, 12328
[8] Human Action Recognition by Decision-Making Level Fusion Based on Spatial-Temporal Features
Li Yandi
Xu Xiping
ACTA OPTICA SINICA, 2018, 38 (08)
[9] Hierarchical and Spatio-Temporal Sparse Representation for Human Action Recognition
Tian, Yi
Kong, Yu
Ruan, Qiuqi
An, Gaoyun
Fu, Yun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) : 1748 - 1762
[10] ST-HViT: spatial-temporal hierarchical vision transformer for action recognition
Xia, Limin
Fu, Weiye
PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)

← 1 2 3 4 5 →