A Spatio-Temporal Motion Network for Action Recognition Based on Spatial Attention

被引:10
|
作者
Yang, Qi [1 ,2 ]
Lu, Tongwei [1 ,2 ]
Zhou, Huabing [1 ,2 ]
机构
[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Wuhan 430205, Peoples R China
[2] Wuhan Inst Technol, Hubei Key Lab Intelligent Robot, Wuhan 430205, Peoples R China
基金
中国国家自然科学基金;
关键词
temporal modeling; spatio-temporal motion; group convolution; spatial attention;
D O I
10.3390/e24030368
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Temporal modeling is the key for action recognition in videos, but traditional 2D CNNs do not capture temporal relationships well. 3D CNNs can achieve good performance, but are computationally intensive and not well practiced on existing devices. Based on these problems, we design a generic and effective module called spatio-temporal motion network (SMNet). SMNet maintains the complexity of 2D and reduces the computational effort of the algorithm while achieving performance comparable to 3D CNNs. SMNet contains a spatio-temporal excitation module (SE) and a motion excitation module (ME). The SE module uses group convolution to fuse temporal information to reduce the number of parameters in the network, and uses spatial attention to extract spatial information. The ME module uses the difference between adjacent frames to extract feature-level motion patterns between adjacent frames, which can effectively encode motion features and help identify actions efficiently. We use ResNet-50 as the backbone network and insert SMNet into the residual blocks to form a simple and effective action network. The experiment results on three datasets, namely Something-Something V1, Something-Something V2, and Kinetics-400, show that it out performs state-of-the-arts motion recognition networks.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] SATNet: A Spatial Attention Based Network for Hyperspectral Image Classification
    Hong, Qingqing
    Zhong, Xinyi
    Chen, Weitong
    Zhang, Zhenghua
    Li, Bin
    Sun, Hao
    Yang, Tianbao
    Tan, Changwei
    REMOTE SENSING, 2022, 14 (22)
  • [42] Research on Human Action Recognition Based on Convolutional Neural Network
    Wang, Peng
    Yang, Yuliang
    Li, Wanchong
    Zhang, Linhao
    Wang, Mengyuan
    Zhang, Xiaobo
    Zhu, Mengyu
    2019 28TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC), 2019, : 28 - 32
  • [43] Intelligent Diagnosis of Gearbox Based on Spatial Attention Convolutional Neural Network
    Wang, Pengxin
    Han, Changkun
    Song, Liuyang
    Wang, Huaqing
    Cui, Lingli
    PROCEEDINGS OF 2021 7TH INTERNATIONAL CONFERENCE ON CONDITION MONITORING OF MACHINERY IN NON-STATIONARY OPERATIONS (CMMNO), 2021, : 184 - 189
  • [44] Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention
    Gan, Chenquan
    Xiao, Junhao
    Wang, Zhangyi
    Zhang, Zufan
    Zhu, Qingyi
    IMAGE AND VISION COMPUTING, 2022, 117
  • [45] Person Re-identification Algorithm Based on Spatial Attention Network
    Hou, Shaoqi
    Liu, Chunhui
    Yin, Kangning
    Yin, Guangqiang
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT III, 2021, 12939 : 117 - 124
  • [46] A Fully Convolutional Network based on Spatial Attention for Saliency Object Detection
    Chen, Kai
    Wang, Yongxiong
    Hu, Chuanfei
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5707 - 5711
  • [47] Rendezvous in time: an attention-based temporal fusion approach for surgical triplet recognition
    Sharma, Saurav
    Nwoye, Chinedu Innocent
    Mutter, Didier
    Padoy, Nicolas
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (06) : 1053 - 1059
  • [48] Rendezvous in time: an attention-based temporal fusion approach for surgical triplet recognition
    Saurav Sharma
    Chinedu Innocent Nwoye
    Didier Mutter
    Nicolas Padoy
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 1053 - 1059
  • [49] The Impact of Scaling Rather Than Shaping Attention: Changes in the Scale of Attention Using Global Motion Inducers Influence Both Spatial and Temporal Acuity
    Lawrence, Rebecca K.
    Edwards, Mark
    Goodhew, Stephanie C.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2020, 46 (03) : 313 - 323
  • [50] Action Recognition Based on Multi-Level Topological Channel Attention of Human Skeleton
    Hu, Kai
    Shen, Chaowen
    Wang, Tianyan
    Shen, Shuai
    Cai, Chengxue
    Huang, Huaming
    Xia, Min
    SENSORS, 2023, 23 (24)