Action Recognition Using Multiple Pooling Strategies of CNN Features

被引：0

作者：

Haifeng Hu

Zhongke Liao

Xiang Xiao

机构：

[1] Sun Yat-sen Univercity,School of Electronic and Information Engineering

来源：

Neural Processing Letters | 2019年 / 50卷

关键词：

Action recognition; Convolutional neural networks; Multiple pooling strategies;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The deep convolution neural network has shown great potential in the field of human action recognition. For the sake of obtaining compact and discriminative feature representation, this paper proposes multiple pooling strategies using CNN features. We explore three different pooling strategies, which are called space-time feature pooling (STFP), time filter pooling (TFP) and spatio-temporal pyramid pooling (STPP), respectively. STFP shares the advantages of both hand-crafted features and deep ConvNets features. TFP reflects the change of elements on each CNN feature map over time. STPP focuses on the spatial and temporal pyramid structure of the feature maps. We aggregate these pooled features to produce a new discriminative video descriptor. Experimental results show that the three strategies have complementary advantages on the challenging YouTube, UCF50 and UCF101 datasets, and our video representation is comparable to the previous state-of-the-art algorithms.

引用

页码：379 / 396

页数：17

共 50 条

[41] Handcrafted versus CNN Features for Ear Recognition
Alshazly, Hammam
Linse, Christoph
Barth, Erhardt
Martinetz, Thomas
SYMMETRY-BASEL, 2019, 11 (12):
[42] Hierarchical Gaussian descriptor based on local pooling for action recognition
Nguyen, Xuan Son
Mouaddib, Abdel-Illah
Thanh Phuong Nguyen
MACHINE VISION AND APPLICATIONS, 2019, 30 (02) : 321 - 343
[43] Hierarchical Gaussian descriptor based on local pooling for action recognition
Xuan Son Nguyen
Abdel-Illah Mouaddib
Thanh Phuong Nguyen
Machine Vision and Applications, 2019, 30 : 321 - 343
[44] Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition
Wang, Yunfeng
Zhou, Wengang
Zhang, Qilin
Li, Houqiang
2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
[45] Robust Human Action Recognition Using Dynamic Movement Features
Zhang, Huiwen
Fu, Mingliang
Luo, Haitao
Zhou, Weijia
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2017, PT I, 2017, 10462 : 474 - 484
[46] Human Action Recognition in Videos Using Hybrid Motion Features
Liu, Si
Liu, Jing
Zhang, Tianzhu
Lu, Hanqing
ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 411 - 421
[47] Bidirectional LSTM with saliency-aware 3D-CNN features for human action recognition
Arif, Sheeraz
Wang, Jing
Siddiqui, Adnan
Hussain, Rashid
Hussain, Fida
JOURNAL OF ENGINEERING RESEARCH, 2021, 9 (3A): : 115 - 133
[48] Action Recognition Using Multilevel Features and Latent Structural SVM
Wu, Xinxiao
Xu, Dong
Duan, Lixin
Luo, Jiebo
Jia, Yunde
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (08) : 1422 - 1431
[49] Temporal Pyramid Pooling Based Relation Network for Action Recognition
Zheng, Zhenxing
An, Gaoyun
Ruan, Qiuqi
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 644 - 647
[50] HUMAN ACTION RECOGNITION USING ROBUST POWER SPECTRUM FEATURES
Ragheb, Hossein
Velastin, Sergio
Remagnino, Paolo
Ellis, Tim
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 753 - 756

← 1 2 3 4 5 →