FEXNet: Foreground Extraction Network for Human Action Recognition

被引:29
|
作者
Shen, Zhongwei [1 ]
Wu, Xiao-Jun [1 ]
Xu, Tianyang [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金;
关键词
Convolutional neural networks; Spatiotemporal phenomena; Feature extraction; Three-dimensional displays; Solid modeling; Iron; Image recognition; Foreground-related features; spatiotemporal modeling; action recognition;
D O I
10.1109/TCSVT.2021.3103677
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As most human actions in video sequences embody the continuous interactions between foregrounds rather than the background scene, it is significant to disentangle these foregrounds from the background for advanced action recognition systems. In this paper, therefore, we propose a Foreground EXtraction (FEX) block to explicitly model the foreground clues to achieve effective management of action subjects. In particular, the designed FEX block contains two components. The first part is a Foreground Enhancement (FE) module, which highlights the potential feature channels related to the action attributes, providing channel-level refinement for the following spatiotemporal modeling. The second phase is a Scene Segregation (SS) module, which splits feature maps into foreground and background. Specifically, a temporal model with dynamic enhancement is constructed for the foreground part, reflecting the essential nature of the action category. While the background is modeled using simple spatial convolutions, mapping the inputs to the consistent feature space. The FEX blocks can be inserted into existing 2D CNNs (denoted as FEXNet) for spatiotemporal modeling, concentrating on the foreground clues for effective action inference. Our experiments performed on Something-Something V1, V2 and Kinetics400 verify the effectiveness of the proposed method.
引用
收藏
页码:3141 / 3151
页数:11
相关论文
共 50 条
  • [31] Short-Term Action Learning for Video Action Recognition
    Ting-Long, Liu
    IEEE ACCESS, 2024, 12 : 30867 - 30875
  • [32] End-to-end temporal attention extraction and human action recognition
    Zhang, Hong
    Xin, Miao
    Wang, Shuhang
    Yang, Yifan
    Zhang, Lei
    Wang, Helong
    MACHINE VISION AND APPLICATIONS, 2018, 29 (07) : 1127 - 1142
  • [33] A foreground-focused action recognition algorithm for intelligent unmanned systems
    Li, Jingyu
    Kang, Xiao
    Jin, Lu
    Wu, Yue
    Hai, Dan
    Su, Bo
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 500 - 504
  • [34] Segmentation and selective feature extraction for human detection to the direction of action recognition
    Konwar L.
    Talukdar A.K.
    Sarma K.K.
    Saikia N.
    Rajbangshi S.C.
    International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 1371 - 1386
  • [35] Multi-Modal Human Action Recognition With Sub-Action Exploiting and Class-Privacy Preserved Collaborative Representation Learning
    Liang, Chengwu
    Liu, Deyin
    Qi, Lin
    Guan, Ling
    IEEE ACCESS, 2020, 8 : 39920 - 39933
  • [36] End-to-end temporal attention extraction and human action recognition
    Hong Zhang
    Miao Xin
    Shuhang Wang
    Yifan Yang
    Lei Zhang
    Helong Wang
    Machine Vision and Applications, 2018, 29 : 1127 - 1142
  • [37] Human action recognition using an optical flow-gated recurrent neural network
    Giveki, Davar
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (03)
  • [38] Hierarchical Human Action Recognition to Measure the Performance of Manual Labor
    Hernandez, Jefferson
    Valarezo, Gabriela
    Cobos, Richard
    Kim, Joo Wang
    Palacios, Ricardo
    Abad, Andres G.
    IEEE ACCESS, 2021, 9 : 103110 - 103119
  • [39] Human Action Recognition From Various Data Modalities: A Review
    Sun, Zehua
    Ke, Qiuhong
    Rahmani, Hossein
    Bennamoun, Mohammed
    Wang, Gang
    Liu, Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3200 - 3225
  • [40] Global and Local Knowledge-Aware Attention Network for Action Recognition
    Zheng, Zhenxing
    An, Gaoyun
    Wu, Dapeng
    Ruan, Qiuqi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (01) : 334 - 347