DTA:Double LSTM with Temporal-wise Attention Network for Action Recognition

被引：0

作者：

Xu, Yangyang ^{[1
,2
]}

Wang, Lei ^{[2
,3
]}

Cheng, Jun ^{[2
,3
]}

Xia, Haiying ^{[1
]}

Yin, Jianqin ^{[4
]}

机构：

[1] Guangxi Normal Univ, Guilin, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Virtual Real & Human Interact Te, Shenzhen, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China

[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China

来源：

PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年

基金：

中国国家自然科学基金;

关键词：

Action Recognition; CNN; LSTM; Attention Model;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a new architecture for human action recognition by using a convolution neural networks (CNN) and two Long Short-Term Memory(LSTM) networks with temporal-wise attention model. We call this network the Double LSTM with Temporal-wise Attention network (DTA). The features extracted by our model are both spatially and temporally. The attention model can learn which parts in which frames in a video are relevant to the video label and pay more attention on them. We designed a joint optimization layer (JOL) to jointly process two kinds of feature produced by two LSTMs. The proposed networks achieved improved performance on three widely used datasets-the UCF Sports dataset, the UCF11 dataset and the HMDB51 dataset.

引用

页码：1676 / 1680

页数：5

共 50 条

[21] EEG-based emotion recognition via capsule network with channel-wise attention and LSTM models
Lina Deng
Xiaoliang Wang
Frank Jiang
Robin Doss
CCF Transactions on Pervasive Computing and Interaction, 2021, 3 : 425 - 435
[22] EEG-based emotion recognition via capsule network with channel-wise attention and LSTM models
Deng, Lina
Wang, Xiaoliang
Jiang, Frank
Doss, Robin
CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2021, 3 (04) : 425 - 435
[23] SPATIO-TEMPORAL SLOWFAST SELF-ATTENTION NETWORK FOR ACTION RECOGNITION
Kim, Myeongjun
Kim, Taehun
Kim, Daijin
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2206 - 2210
[24] Temporal Group Deep Network Action Recognition Algorithm Based on Attention Mechanism
Hu Z.
Diao P.
Zhang R.
Li S.
Zhao M.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 892 - 900
[25] Residual attention fusion network for video action recognition
Li, Ao
Yi, Yang
Liang, Daan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
[26] Spatial-Temporal Attention for Action Recognition
Sun, Dengdi
Wu, Hanqing
Ding, Zhuanlian
Luo, Bin
Tang, Jin
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 854 - 864
[27] IMPROVING HUMAN ACTION RECOGNITION BY TEMPORAL ATTENTION
Liu, Zhikang
Tian, Ye
Wang, Zilei
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 870 - 874
[28] Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection
Song, Sijie
Lan, Cuiling
Xing, Junliang
Zeng, Wenjun
Liu, Jiaying
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3459 - 3471
[29] Spatial–Temporal gated graph attention network for skeleton-based action recognition
Mrugendrasinh Rahevar
Amit Ganatra
Pattern Analysis and Applications, 2023, 26 (3) : 929 - 939
[30] R-STAN: Residual Spatial-Temporal Attention Network for Action Recognition
Liu, Quanle
Che, Xiangjiu
Bie, Mei
IEEE ACCESS, 2019, 7 : 82246 - 82255

← 1 2 3 4 5 →