Action Recognition Using Multiple Pooling Strategies of CNN Features

被引:0
|
作者
Haifeng Hu
Zhongke Liao
Xiang Xiao
机构
[1] Sun Yat-sen Univercity,School of Electronic and Information Engineering
来源
Neural Processing Letters | 2019年 / 50卷
关键词
Action recognition; Convolutional neural networks; Multiple pooling strategies;
D O I
暂无
中图分类号
学科分类号
摘要
The deep convolution neural network has shown great potential in the field of human action recognition. For the sake of obtaining compact and discriminative feature representation, this paper proposes multiple pooling strategies using CNN features. We explore three different pooling strategies, which are called space-time feature pooling (STFP), time filter pooling (TFP) and spatio-temporal pyramid pooling (STPP), respectively. STFP shares the advantages of both hand-crafted features and deep ConvNets features. TFP reflects the change of elements on each CNN feature map over time. STPP focuses on the spatial and temporal pyramid structure of the feature maps. We aggregate these pooled features to produce a new discriminative video descriptor. Experimental results show that the three strategies have complementary advantages on the challenging YouTube, UCF50 and UCF101 datasets, and our video representation is comparable to the previous state-of-the-art algorithms.
引用
收藏
页码:379 / 396
页数:17
相关论文
共 50 条
  • [1] Action Recognition Using Multiple Pooling Strategies of CNN Features
    Hu, Haifeng
    Liao, Zhongke
    Xiao, Xiang
    NEURAL PROCESSING LETTERS, 2019, 50 (01) : 379 - 396
  • [2] EFFICIENT POOLING OF IMAGE BASED CNN FEATURES FOR ACTION RECOGNITION IN VIDEOS
    Banerjee, Biplab
    Murino, Vittorio
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2637 - 2641
  • [3] Binary Hashing CNN Features for Action Recognition
    Li, Weisheng
    Feng, Chen
    Xiao, Bin
    Chen, Yanquan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (09): : 4412 - 4428
  • [4] Human Action Recognition Based on SVM Using Multiple Features
    Huang, Xianping
    Zheng, Lili
    Liang, Ronhua
    Wang, Wanliang
    Ma, Xiangyin
    2012 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2012), 2012, 12 : 160 - 165
  • [5] Human action recognition using attention based LSTM network with dilated CNN features
    Muhammad, Khan
    Mustaqeem
    Ullah, Amin
    Imran, Ali Shariq
    Sajjad, Muhammad
    Kiran, Mustafa Servet
    Sannino, Giovanna
    de Albuquerque, Victor Hugo C.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 820 - 830
  • [6] Rank Pooling for Action Recognition
    Fernando, Basura
    Gavves, Efstratios
    Oramas, Jose M.
    Ghodrati, Amir
    Tuytelaars, Tinne
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 773 - 787
  • [7] Convolution neural network with multiple pooling strategies for speech emotion recognition
    Jiang, Pengxu
    Zou, Cairong
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 89 - 92
  • [8] Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features
    Ullah, Amin
    Ahmad, Jamil
    Muhammad, Khan
    Sajjad, Muhammad
    Baik, Sung Wook
    IEEE ACCESS, 2018, 6 : 1155 - 1166
  • [9] Adaptive Pooling of the Most Relevant Spatio-Temporal Features for Action Recognition
    Ahmed, Faisal
    Paul, Padma Polash
    Gavrilova, Marina
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 177 - 180
  • [10] Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning
    Ali, Saad
    Shah, Mubarak
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) : 288 - 303