Action Recognition Using Multiple Pooling Strategies of CNN Features

被引：0

作者：

Haifeng Hu

Zhongke Liao

Xiang Xiao

机构：

[1] Sun Yat-sen Univercity,School of Electronic and Information Engineering

来源：

Neural Processing Letters | 2019年 / 50卷

关键词：

Action recognition; Convolutional neural networks; Multiple pooling strategies;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The deep convolution neural network has shown great potential in the field of human action recognition. For the sake of obtaining compact and discriminative feature representation, this paper proposes multiple pooling strategies using CNN features. We explore three different pooling strategies, which are called space-time feature pooling (STFP), time filter pooling (TFP) and spatio-temporal pyramid pooling (STPP), respectively. STFP shares the advantages of both hand-crafted features and deep ConvNets features. TFP reflects the change of elements on each CNN feature map over time. STPP focuses on the spatial and temporal pyramid structure of the feature maps. We aggregate these pooled features to produce a new discriminative video descriptor. Experimental results show that the three strategies have complementary advantages on the challenging YouTube, UCF50 and UCF101 datasets, and our video representation is comparable to the previous state-of-the-art algorithms.

引用

页码：379 / 396

页数：17

共 50 条

[1] Action Recognition Using Multiple Pooling Strategies of CNN Features
Hu, Haifeng
Liao, Zhongke
Xiao, Xiang
NEURAL PROCESSING LETTERS, 2019, 50 (01) : 379 - 396
[2] EFFICIENT POOLING OF IMAGE BASED CNN FEATURES FOR ACTION RECOGNITION IN VIDEOS
Banerjee, Biplab
Murino, Vittorio
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2637 - 2641
[3] Binary Hashing CNN Features for Action Recognition
Li, Weisheng
Feng, Chen
Xiao, Bin
Chen, Yanquan
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (09): : 4412 - 4428
[4] Human Action Recognition Based on SVM Using Multiple Features
Huang, Xianping
Zheng, Lili
Liang, Ronhua
Wang, Wanliang
Ma, Xiangyin
2012 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2012), 2012, 12 : 160 - 165
[5] Human action recognition using attention based LSTM network with dilated CNN features
Muhammad, Khan
Mustaqeem
Ullah, Amin
Imran, Ali Shariq
Sajjad, Muhammad
Kiran, Mustafa Servet
Sannino, Giovanna
de Albuquerque, Victor Hugo C.
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 820 - 830
[6] Rank Pooling for Action Recognition
Fernando, Basura
Gavves, Efstratios
Oramas, Jose M.
Ghodrati, Amir
Tuytelaars, Tinne
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 773 - 787
[7] Convolution neural network with multiple pooling strategies for speech emotion recognition
Jiang, Pengxu
Zou, Cairong
2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 89 - 92
[8] Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features
Ullah, Amin
Ahmad, Jamil
Muhammad, Khan
Sajjad, Muhammad
Baik, Sung Wook
IEEE ACCESS, 2018, 6 : 1155 - 1166
[9] Adaptive Pooling of the Most Relevant Spatio-Temporal Features for Action Recognition
Ahmed, Faisal
Paul, Padma Polash
Gavrilova, Marina
PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 177 - 180
[10] Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning
Ali, Saad
Shah, Mubarak
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) : 288 - 303

← 1 2 3 4 5 →