GMNet: an action recognition network with global motion representation

被引:0
|
作者
Mingwei Liu
Yi Zhang
机构
[1] Sichuan University,Department of Computer Science
来源
International Journal of Machine Learning and Cybernetics | 2023年 / 14卷
关键词
Action recognition; Deep learning; Spatio-temporal convolution; Motion feature representation;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, an astonishing progress has been made in action recognition. However, the traditional spatio-temporal convolution kernels cannot learn sufficient motion information, which is the key step in action recognition. Therefore, a more effective motion representation approach is required to reason the motion cues in videos. In this light, we propose GMNet, an action recognition network with global motion representation to fulfill such task. It includes a short-term motion feature extraction module and a motion feature aggregation module. The former one is capable of capturing local motion features from adjacent frames, while the latter one excels at aggregating the above features to yield global motion representations. GMNet is easily compatible to any mainstream backbones to realize end-to-end training without additional supervision. Extensive experiments have been carried out on popular benchmarks (Something-Something V1 & V2, Diving-48, Jester and Kinetics 400) to testify its effectiveness. It turns out that GMNet surpasses most of the state-of-the-art methods.
引用
收藏
页码:1683 / 1693
页数:10
相关论文
共 50 条
  • [1] GMNet: an action recognition network with global motion representation
    Liu, Mingwei
    Zhang, Yi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1683 - 1693
  • [2] UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION
    Yang, Xiaohang
    Kong, Lingtong
    Yang, Jie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2445 - 2449
  • [3] A spatiotemporal and motion information extraction network for action recognition
    Wang, Wei
    Wang, Xianmin
    Zhou, Mingliang
    Wei, Xuekai
    Li, Jing
    Ren, Xiaojun
    Zong, Xuemei
    WIRELESS NETWORKS, 2024, 30 (06) : 5389 - 5405
  • [4] Action recognition and tracking via deep representation extraction and motion bases learning
    Li, Hao-Ting
    Liu, Yung-Pin
    Chang, Yun-Kai
    Chiang, Chen-Kuo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) : 11845 - 11864
  • [5] Action recognition and tracking via deep representation extraction and motion bases learning
    Hao-Ting Li
    Yung-Pin Liu
    Yun-Kai Chang
    Chen-Kuo Chiang
    Multimedia Tools and Applications, 2022, 81 : 11845 - 11864
  • [6] Motion Feature Network: Fixed Motion Filter for Action Recognition
    Lee, Myunggi
    Lee, Seungeui
    Son, Sungjoon
    Park, Gyutae
    Kwak, Nojun
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 392 - 408
  • [7] Global Temporal Difference Network for Action Recognition
    Xie, Zhao
    Chen, Jiansong
    Wu, Kewei
    Guo, Dan
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7594 - 7606
  • [8] Global Temporal Representation Based CNNs for Infrared Action Recognition
    Liu, Yang
    Lu, Zhaoyang
    Li, Jing
    Yang, Tao
    Yao, Chao
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (06) : 848 - 852
  • [9] A motion-aware ConvLSTM network for action recognition
    Mahshid Majd
    Reza Safabakhsh
    Applied Intelligence, 2019, 49 : 2515 - 2521
  • [10] Differential motion attention network for efficient action recognition
    Liu, Caifeng
    Gu, Fangjie
    VISUAL COMPUTER, 2025, 41 (03): : 1719 - 1731