Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition

被引:56
作者
Zang, Jinliang [1 ]
Wang, Le [1 ]
Liu, Ziyi [1 ]
Zhang, Qilin [2 ]
Niu, Zhenxing
Hua, Gang [3 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Shaanxi, Peoples R China
[2] HERE Technol, Chicago, IL 60606 USA
[3] Microsoft Res, Redmond, WA 98052 USA
来源
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018 | 2018年 / 519卷
基金
中国博士后科学基金;
关键词
Action recognition; Attention model; Convolutional neural networks; Video-level prediction; Temporal weighting;
D O I
10.1007/978-3-319-92007-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-based Temporal Weighted CNN (ATW), which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW frame- work is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experiments show that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.
引用
收藏
页码:97 / 108
页数:12
相关论文
共 50 条
[31]   Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition [J].
Rahevar, Mrugendrasinh ;
Ganatra, Amit ;
Saba, Tanzila ;
Rehman, Amjad ;
Bahaj, Saeed Ali .
IEEE ACCESS, 2023, 11 :21546-21553
[32]   Temporal Convolutional Network for Gas Concentration Prediction Based on Weighted Loss and Channel Coupling Attention [J].
Li, Shuaiyong ;
Zhang, Sai ;
Zhang, Chao ;
Liu, Liang ;
Zhang, Xuyuntao .
IEEE SENSORS JOURNAL, 2025, 25 (06) :9802-9816
[33]   Temporal Group Deep Network Action Recognition Algorithm Based on Attention Mechanism [J].
Hu Z. ;
Diao P. ;
Zhang R. ;
Li S. ;
Zhao M. .
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10) :892-900
[34]   Independent Dual Graph Attention Convolutional Network for Skeleton-Based Action Recognition [J].
Huo, Jinze ;
Cai, Haibin ;
Meng, Qinggang .
NEUROCOMPUTING, 2024, 583
[35]   Multi-attention graph convolutional network for skeleton-based action recognition [J].
Chen, Chen ;
Chai, Lin .
PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, :6190-6195
[36]   Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition [J].
Xia, Limin ;
Fu, Weiye .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08) :11611-11626
[37]   Mini-TKAGCN: a lightweight Graph Convolutional Network via Temporal Kernel Attention for Skeleton-based Action Recognition [J].
Liu, Yanan ;
Dong, Shiqi ;
Zhang, Hao ;
Xu, Dan ;
Li, Haipeng .
THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
[38]   Hyperspectral Band Selection Using Attention-Based Convolutional Neural Networks [J].
Lorenzo, Pablo Ribalta ;
Tulczyjew, Lukasz ;
Marcinkiewicz, Michal ;
Nalepa, Jakub .
IEEE ACCESS, 2020, 8 :42384-42403
[39]   Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection [J].
Song, Sijie ;
Lan, Cuiling ;
Xing, Junliang ;
Zeng, Wenjun ;
Liu, Jiaying .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) :3459-3471
[40]   An Attention-Based Convolutional Neural Network With Spatial Transformer Module for Automated Optical Inspection of Small Objects [J].
Kim, Hyun Yong ;
Yi, Taek Joon ;
Lee, Jong Yun .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74