Action Recognition Using Multiple Pooling Strategies of CNN Features

被引:0
|
作者
Haifeng Hu
Zhongke Liao
Xiang Xiao
机构
[1] Sun Yat-sen Univercity,School of Electronic and Information Engineering
来源
Neural Processing Letters | 2019年 / 50卷
关键词
Action recognition; Convolutional neural networks; Multiple pooling strategies;
D O I
暂无
中图分类号
学科分类号
摘要
The deep convolution neural network has shown great potential in the field of human action recognition. For the sake of obtaining compact and discriminative feature representation, this paper proposes multiple pooling strategies using CNN features. We explore three different pooling strategies, which are called space-time feature pooling (STFP), time filter pooling (TFP) and spatio-temporal pyramid pooling (STPP), respectively. STFP shares the advantages of both hand-crafted features and deep ConvNets features. TFP reflects the change of elements on each CNN feature map over time. STPP focuses on the spatial and temporal pyramid structure of the feature maps. We aggregate these pooled features to produce a new discriminative video descriptor. Experimental results show that the three strategies have complementary advantages on the challenging YouTube, UCF50 and UCF101 datasets, and our video representation is comparable to the previous state-of-the-art algorithms.
引用
收藏
页码:379 / 396
页数:17
相关论文
共 50 条
  • [41] Handcrafted versus CNN Features for Ear Recognition
    Alshazly, Hammam
    Linse, Christoph
    Barth, Erhardt
    Martinetz, Thomas
    SYMMETRY-BASEL, 2019, 11 (12):
  • [42] Hierarchical Gaussian descriptor based on local pooling for action recognition
    Nguyen, Xuan Son
    Mouaddib, Abdel-Illah
    Thanh Phuong Nguyen
    MACHINE VISION AND APPLICATIONS, 2019, 30 (02) : 321 - 343
  • [43] Hierarchical Gaussian descriptor based on local pooling for action recognition
    Xuan Son Nguyen
    Abdel-Illah Mouaddib
    Thanh Phuong Nguyen
    Machine Vision and Applications, 2019, 30 : 321 - 343
  • [44] Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition
    Wang, Yunfeng
    Zhou, Wengang
    Zhang, Qilin
    Li, Houqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [45] Robust Human Action Recognition Using Dynamic Movement Features
    Zhang, Huiwen
    Fu, Mingliang
    Luo, Haitao
    Zhou, Weijia
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2017, PT I, 2017, 10462 : 474 - 484
  • [46] Human Action Recognition in Videos Using Hybrid Motion Features
    Liu, Si
    Liu, Jing
    Zhang, Tianzhu
    Lu, Hanqing
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 411 - 421
  • [47] Bidirectional LSTM with saliency-aware 3D-CNN features for human action recognition
    Arif, Sheeraz
    Wang, Jing
    Siddiqui, Adnan
    Hussain, Rashid
    Hussain, Fida
    JOURNAL OF ENGINEERING RESEARCH, 2021, 9 (3A): : 115 - 133
  • [48] Action Recognition Using Multilevel Features and Latent Structural SVM
    Wu, Xinxiao
    Xu, Dong
    Duan, Lixin
    Luo, Jiebo
    Jia, Yunde
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (08) : 1422 - 1431
  • [49] Temporal Pyramid Pooling Based Relation Network for Action Recognition
    Zheng, Zhenxing
    An, Gaoyun
    Ruan, Qiuqi
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 644 - 647
  • [50] HUMAN ACTION RECOGNITION USING ROBUST POWER SPECTRUM FEATURES
    Ragheb, Hossein
    Velastin, Sergio
    Remagnino, Paolo
    Ellis, Tim
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 753 - 756