Multi-level channel attention excitation network for human action recognition in videos

被引:4
|
作者
Wu, Hanbo [1 ]
Ma, Xin [1 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Ctr Robot, Sch Control Sci & Engn, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
Human action recognition; 2D CNNs; Channel attention; Spatiotemporal modeling;
D O I
10.1016/j.image.2023.116940
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Channel attention mechanism has continuously attracted strong interests and shown great potential in enhancing the performance of deep CNNs. However, when applied to video-based human action recognition task, most existing methods generally learn channel attention at frame level, which ignores the temporal dependencies and may limit the recognition performance. In this paper, we propose a novel multi-level channel attention excitation (MCAE) module to model the temporal-related channel attention at both frame and video levels. Specifically, based on video convolutional feature maps, frame-level channel attention (FCA) is generated by exploring time-channel correlations, and video-level channel attention (VCA) is generated by aggregating global motion variations. MCAE firstly recalibrates video feature responses with frame-wise FCA, and then activates the motion-sensitive channel features with motion-aware VCA. MCAE module learns the channel discriminability from multiple levels and can act as a guidance to facilitate efficient spatiotemporal feature modeling in activated motion-sensitive channels. It can be flexibly embedded into 2D networks with very limited extra computation cost to construct MCAE-Net, which effectively enhances the spatiotemporal representation of 2D models for video action recognition task Extensive experiments on five human action datasets show that our method achieves superior or very competitive performance compared with the state -of-the-arts, which demonstrates the effectiveness of the proposed method for improving the performance of human action recognition.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Wavelet Multi-Level Attention Capsule Network for Texture Classification
    Tao, Zhiyong
    Wei, Tong
    Li, Jie
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1215 - 1219
  • [42] Multi-Level Temporal Dilated Dense Prediction for Action Recognition
    Wang, Jinpeng
    Lin, Yiqi
    Zhang, Manlin
    Gao, Yuan
    Ma, Andy J.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2553 - 2566
  • [43] Multi-level Feature Attention Network for medical image segmentation
    Zhang, Yaning
    Yin, Jianjian
    Gu, Yanhui
    Chen, Yi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [44] Multi-level attention network: application to brain tumor classification
    Nagur Shareef Shaik
    Teja Krishna Cherukuri
    Signal, Image and Video Processing, 2022, 16 : 817 - 824
  • [45] A Multi-Level Network for Human Pose Estimation
    Shao, Zhanpeng
    Liu, Peng
    Li, Youfu
    Yang, Jianyu
    Zhou, Xiaolong
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13085 - 13091
  • [46] Image inpainting network based on multi-level attention mechanism
    Xiang, Hongyue
    Min, Weidong
    Wei, Zitai
    Zhu, Meng
    Liu, Mengxue
    Deng, Ziyang
    IET IMAGE PROCESSING, 2024, 18 (02) : 428 - 438
  • [47] Multi-Level Attention Map Network for Multimodal Sentiment Analysis
    Xue, Xiaojun
    Zhang, Chunxia
    Niu, Zhendong
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5105 - 5118
  • [48] Multi-level attention network: application to brain tumor classification
    Shaik, Nagur Shareef
    Cherukuri, Teja Krishna
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (03) : 817 - 824
  • [49] Multi-level attention fusion network assisted by relative entropy alignment for multimodal speech emotion recognition
    Lei, Jianjun
    Wang, Jing
    Wang, Ying
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8478 - 8490
  • [50] Enhancing Food Image Recognition by Multi-Level Fusion and the Attention Mechanism
    Chen, Zengzheng
    Wang, Jianxin
    Wang, Yeru
    FOODS, 2025, 14 (03)