Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition

被引:0
|
作者
Wang, Yunfeng [1 ]
Zhou, Wengang [1 ]
Zhang, Qilin [2 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei, Anhui, Peoples R China
[2] HERE Technol, Highly Automated Driving, Chicago, IL USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP) | 2018年
关键词
Action Recognition; Generalized Attentional Pooling; Convolutional Neural Network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inspired by the recent advance in attentional pooling techniques in image classification and action recognition tasks, we propose the Generalized Attentional Pooling (GAP) based Convolutional Neural Network (CNN) algorithm for action recognition in still images. The proposed GAP-CNN can be formulated as a new approximation of the second-order/bilinear pooling techniques widely used in fine-grained image classification. Unlike the existing rank-1 approximation, a generalized factoring (with nonlinear functions) is introduced to exploit the intrinsic structural information of the sample covariance matrices of convolutional layer outputs. Without requiring preprocessing steps such as object (e.g., human body) bounding boxes detection, the proposed GAP-CNN automatically focuses on the most informative part in still images. With the additional guidance of keypoints of human pose, the proposed GAP-CNN algorithm achieves the state-of-the-art action recognition accuracy on the large-scale MPII still image dataset.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Stratified pooling based deep convolutional neural networks for human action recognition
    Yu, Sheng
    Cheng, Yun
    Su, Songzhi
    Cai, Guorong
    Li, Shaozi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13367 - 13382
  • [2] Stratified pooling based deep convolutional neural networks for human action recognition
    Sheng Yu
    Yun Cheng
    Songzhi Su
    Guorong Cai
    Shaozi Li
    Multimedia Tools and Applications, 2017, 76 : 13367 - 13382
  • [3] Attentional Pooling for Action Recognition
    Girdhar, Rohit
    Ramanan, Deva
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [4] SPATIOTEMPORAL PYRAMID POOLING IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR ACTION RECOGNITION
    Cheng, Cheng
    Lv, Pin
    Su, Bing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3468 - 3472
  • [5] AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Kar, Amlan
    Rai, Nishant
    Sikka, Karan
    Sharma, Gaurav
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5699 - 5708
  • [6] Weighted pooling for image recognition of deep convolutional neural networks
    Zhu, Xiaoning
    Meng, Qingyue
    Ding, Bojian
    Gu, Lize
    Yang, Yixian
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9371 - S9383
  • [7] Multiactivation Pooling Method in Convolutional Neural Networks for Image Recognition
    Zhao, Qi
    Lyu, Shuchang
    Zhang, Boxue
    Feng, Wenquan
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [8] Weighted pooling for image recognition of deep convolutional neural networks
    Xiaoning Zhu
    Qingyue Meng
    Bojian Ding
    Lize Gu
    Yixian Yang
    Cluster Computing, 2019, 22 : 9371 - 9383
  • [9] A multilevel pooling scheme in convolutional neural networks for texture image recognition
    Lyra, Lucas O.
    Fabris, Antonio E.
    Florindo, Joao B.
    APPLIED SOFT COMPUTING, 2024, 152
  • [10] Generalized Max Pooling for Action Recognition
    Trang Nguyen
    Sang Phan
    Thanh Duc Ngo
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 401 - 406