Extracting hierarchical spatial and temporal features for human action recognition

被引：12

作者：

Zhang, Keting ^{[1
]}

Zhang, Liqing ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2018年 / 77卷 / 13期

基金：

中国国家自然科学基金;

关键词：

Hierarchical feature extraction; Dual-channel model; Subspace network; Spatial and temporal representation; Action recognition; PARALLEL FRAMEWORK; HEVC;

D O I：

10.1007/s11042-017-5179-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human action recognition is a challenging computer vision task and many efforts have been made to improve the performance. Most previous work has concentrated on the hand-crafted features or spatial-temporal features learned from multiple contiguous frames. In this paper, we present a dual-channel model to decouple the spatial and temporal feature extraction. More specifically, we propose to capture the complementary static form information from single frame and dynamic motion information from multi-frame differences in two separate channels. In both channels we use two stacked classical subspace networks to learn hierarchical representations, which are subsequently fused for action recognition. Our model is trained and evaluated on three typical benchmarks: KTH, UCF and Hollywood2 datasets. The experimental results illustrate that our approach achieves comparable performances to the state-of-the-art methods. In addition, both feature analysis and control experiments are also carried out to demonstrate the effectiveness of the proposed approach for feature extraction and thereby action recognition.

引用

页码：16053 / 16068

页数：16

共 50 条

[21] Action Recognition Using Mined Hierarchical Compound Features [J].

Gilbert, Andrew ;

Illingworth, John ;

Bowden, Richard .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :883-897

[22] Extracting Discriminative Parts with Flexible Number from Low-Rank Features for Human Action Recognition [J].

Shijian Huang ;

Junyong Ye ;

Tongqing Wang ;

Li Jiang ;

Yang Li ;

Xuegang Wu .

Arabian Journal for Science and Engineering, 2016, 41 :2987-3001

[23] Extracting Discriminative Parts with Flexible Number from Low-Rank Features for Human Action Recognition [J].

Huang, Shijian ;

Ye, Junyong ;

Wang, Tongqing ;

Jiang, Li ;

Li, Yang ;

Wu, Xuegang .

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (08) :2987-3001

[24] Action recognition by extracting pyramidal motion features from skeleton sequences [J].

Lu, Guoliang ;

Zhou, Yiqi ;

Li, Xueyong ;

Lv, Chen .

Lecture Notes in Electrical Engineering, 2015, 339 :251-258

[25] A novel hierarchical framework for human action recognition [J].

Chen, Hongzhao ;

Wang, Guijin ;

Xue, Jing-Hao ;

He, Li .

PATTERN RECOGNITION, 2016, 55 :148-159

[26] Action recognition by learning temporal slowness invariant features [J].

Lishen Pei ;

Mao Ye ;

Xuezhuan Zhao ;

Yumin Dou ;

Jiao Bao .

The Visual Computer, 2016, 32 :1395-1404

[27] Temporal-stochastic tensor features for action recognition [J].

Batalo, Bojan ;

Souza, Lincon S. ;

Gatto, Bernardo B. ;

Sogi, Naoya ;

Fukui, Kazuhiro .

MACHINE LEARNING WITH APPLICATIONS, 2022, 10

[28] Action recognition by learning temporal slowness invariant features [J].

Pei, Lishen ;

Ye, Mao ;

Zhao, Xuezhuan ;

Dou, Yumin ;

Bao, Jiao .

VISUAL COMPUTER, 2016, 32 (11) :1395-1404

[29] Human Action Recognition by Fusion of Convolutional Neural Networks and spatial-temporal Information [J].

Li, Weisheng ;

Ding, Yahui .

8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, :255-259

[30] Spatial-Temporal Neural Networks for Action Recognition [J].

Jing, Chao ;

Wei, Ping ;

Sun, Hongbin ;

Zheng, Nanning .

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 :619-627

← 1 2 3 4 5 →