GMNet: an action recognition network with global motion representation

被引：0

作者：

Mingwei Liu

Yi Zhang

机构：

[1] Sichuan University,Department of Computer Science

来源：

International Journal of Machine Learning and Cybernetics | 2023年 / 14卷

关键词：

Action recognition; Deep learning; Spatio-temporal convolution; Motion feature representation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In recent years, an astonishing progress has been made in action recognition. However, the traditional spatio-temporal convolution kernels cannot learn sufficient motion information, which is the key step in action recognition. Therefore, a more effective motion representation approach is required to reason the motion cues in videos. In this light, we propose GMNet, an action recognition network with global motion representation to fulfill such task. It includes a short-term motion feature extraction module and a motion feature aggregation module. The former one is capable of capturing local motion features from adjacent frames, while the latter one excels at aggregating the above features to yield global motion representations. GMNet is easily compatible to any mainstream backbones to realize end-to-end training without additional supervision. Extensive experiments have been carried out on popular benchmarks (Something-Something V1 & V2, Diving-48, Jester and Kinetics 400) to testify its effectiveness. It turns out that GMNet surpasses most of the state-of-the-art methods.

引用

页码：1683 / 1693

页数：10

共 50 条

[1] GMNet: an action recognition network with global motion representation
Liu, Mingwei
Zhang, Yi
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1683 - 1693
[2] UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION
Yang, Xiaohang
Kong, Lingtong
Yang, Jie
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2445 - 2449
[3] A spatiotemporal and motion information extraction network for action recognition
Wang, Wei
Wang, Xianmin
Zhou, Mingliang
Wei, Xuekai
Li, Jing
Ren, Xiaojun
Zong, Xuemei
WIRELESS NETWORKS, 2024, 30 (06) : 5389 - 5405
[4] Action recognition and tracking via deep representation extraction and motion bases learning
Li, Hao-Ting
Liu, Yung-Pin
Chang, Yun-Kai
Chiang, Chen-Kuo
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) : 11845 - 11864
[5] Action recognition and tracking via deep representation extraction and motion bases learning
Hao-Ting Li
Yung-Pin Liu
Yun-Kai Chang
Chen-Kuo Chiang
Multimedia Tools and Applications, 2022, 81 : 11845 - 11864
[6] Motion Feature Network: Fixed Motion Filter for Action Recognition
Lee, Myunggi
Lee, Seungeui
Son, Sungjoon
Park, Gyutae
Kwak, Nojun
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 392 - 408
[7] Global Temporal Difference Network for Action Recognition
Xie, Zhao
Chen, Jiansong
Wu, Kewei
Guo, Dan
Hong, Richang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7594 - 7606
[8] Global Temporal Representation Based CNNs for Infrared Action Recognition
Liu, Yang
Lu, Zhaoyang
Li, Jing
Yang, Tao
Yao, Chao
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (06) : 848 - 852
[9] A motion-aware ConvLSTM network for action recognition
Mahshid Majd
Reza Safabakhsh
Applied Intelligence, 2019, 49 : 2515 - 2521
[10] Differential motion attention network for efficient action recognition
Liu, Caifeng
Gu, Fangjie
VISUAL COMPUTER, 2025, 41 (03): : 1719 - 1731

← 1 2 3 4 5 →