FEXNet: Foreground Extraction Network for Human Action Recognition

被引:29
|
作者
Shen, Zhongwei [1 ]
Wu, Xiao-Jun [1 ]
Xu, Tianyang [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金;
关键词
Convolutional neural networks; Spatiotemporal phenomena; Feature extraction; Three-dimensional displays; Solid modeling; Iron; Image recognition; Foreground-related features; spatiotemporal modeling; action recognition;
D O I
10.1109/TCSVT.2021.3103677
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As most human actions in video sequences embody the continuous interactions between foregrounds rather than the background scene, it is significant to disentangle these foregrounds from the background for advanced action recognition systems. In this paper, therefore, we propose a Foreground EXtraction (FEX) block to explicitly model the foreground clues to achieve effective management of action subjects. In particular, the designed FEX block contains two components. The first part is a Foreground Enhancement (FE) module, which highlights the potential feature channels related to the action attributes, providing channel-level refinement for the following spatiotemporal modeling. The second phase is a Scene Segregation (SS) module, which splits feature maps into foreground and background. Specifically, a temporal model with dynamic enhancement is constructed for the foreground part, reflecting the essential nature of the action category. While the background is modeled using simple spatial convolutions, mapping the inputs to the consistent feature space. The FEX blocks can be inserted into existing 2D CNNs (denoted as FEXNet) for spatiotemporal modeling, concentrating on the foreground clues for effective action inference. Our experiments performed on Something-Something V1, V2 and Kinetics400 verify the effectiveness of the proposed method.
引用
收藏
页码:3141 / 3151
页数:11
相关论文
共 50 条
  • [41] Human Tumble Action Recognition Using Spiking Neuron Network
    Li, Yu
    Wang, Ke
    Huang, MinFeng
    Li, RuiFeng
    Gao, TianZe
    Wu, Jun
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5309 - 5313
  • [42] Action Recognition Using High Temporal Resolution 3D Neural Network Based on Dilated Convolution
    Xu, Yongyang
    Feng, Yaxing
    Xie, Zhong
    Xie, Mingyu
    Luo, Wei
    IEEE ACCESS, 2020, 8 : 165365 - 165372
  • [43] Batch Entropy Supervised Convolutional Neural Networks for Feature Extraction and Harmonizing for Action Recognition
    Hossain, Md Imtiaz
    Siddique, Ashraf
    Hossain, Md Alamgir
    Hossain, Md Delowar
    Huh, Eui-Nam
    IEEE ACCESS, 2020, 8 : 206427 - 206444
  • [44] Scale-Aware Graph Convolutional Network With Part-Level Refinement for Skeleton-Based Human Action Recognition
    Li, Chang
    Mao, Yingchi
    Huang, Qian
    Zhu, Xiaowei
    Wu, Jie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4311 - 4324
  • [45] Action Recognition Based on a Hybrid Deep Network
    Zou Y.
    Zhou X.
    Ren X.
    SN Computer Science, 2021, 2 (6)
  • [46] Human Skeleton Feature Optimizer and Adaptive Structure Enhancement Graph Convolution Network for Action Recognition
    Xiong, Xin
    Min, Weidong
    Wang, Qi
    Zha, Cheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 342 - 353
  • [47] Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Liu, Yanan
    Li, Yanqiu
    Zhang, Hao
    Zhang, Xuejie
    Xu, Dan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9445 - 9457
  • [48] Sensor-Based Gymnastics Action Recognition Using Time-Series Images and a Lightweight Feature Fusion Network
    Wang, Wanyue
    Lian, Chao
    Zhao, Yuliang
    Zhan, Zhikun
    IEEE SENSORS JOURNAL, 2024, 24 (24) : 42573 - 42583
  • [49] STA-CNN: Convolutional Spatial-Temporal Attention Learning for Action Recognition
    Yang, Hao
    Yuan, Chunfeng
    Zhang, Li
    Sun, Yunda
    Hu, Weiming
    Maybank, Stephen J.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5783 - 5793
  • [50] Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition
    Yang, Jianyu
    Liu, Wu
    Yuan, Junsong
    Mei, Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 883 - 898