DTA:Double LSTM with Temporal-wise Attention Network for Action Recognition

被引：0

作者：

Xu, Yangyang ^{[1
,2
]}

Wang, Lei ^{[2
,3
]}

Cheng, Jun ^{[2
,3
]}

Xia, Haiying ^{[1
]}

Yin, Jianqin ^{[4
]}

机构：

[1] Guangxi Normal Univ, Guilin, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Virtual Real & Human Interact Te, Shenzhen, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China

[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China

来源：

PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年

基金：

中国国家自然科学基金;

关键词：

Action Recognition; CNN; LSTM; Attention Model;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a new architecture for human action recognition by using a convolution neural networks (CNN) and two Long Short-Term Memory(LSTM) networks with temporal-wise attention model. We call this network the Double LSTM with Temporal-wise Attention network (DTA). The features extracted by our model are both spatially and temporally. The attention model can learn which parts in which frames in a video are relevant to the video label and pay more attention on them. We designed a joint optimization layer (JOL) to jointly process two kinds of feature produced by two LSTMs. The proposed networks achieved improved performance on three widely used datasets-the UCF Sports dataset, the UCF11 dataset and the HMDB51 dataset.

引用

页码：1676 / 1680

页数：5

共 50 条

[31] Graph transformer network with temporal kernel attention for skeleton-based action recognition
Liu, Yanan
Zhang, Hao
Xu, Dan
He, Kangjian
KNOWLEDGE-BASED SYSTEMS, 2022, 240
[32] Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
Liu, Jun
Shahroudy, Amir
Xu, Dong
Kot, Alex C.
Wang, Gang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) : 3007 - 3021
[33] Basketball action recognition based on the combination of YOLO and a deep fuzzy LSTM network
Khobdeh, Soroush Babaee
Yamaghani, Mohammad Reza
Sareshkeh, Siavash Khodaparast
JOURNAL OF SUPERCOMPUTING, 2024, 80 (03) : 3528 - 3553
[34] Joint spatial-temporal attention for action recognition
Yu, Tingzhao
Guo, Chaoxu
Wang, Lingfeng
Gu, Huxiang
Xiang, Shiming
Pan, Chunhong
PATTERN RECOGNITION LETTERS, 2018, 112 : 226 - 233
[35] Dual attention convolutional network for action recognition
Li, Xiaoqiang
Xie, Miao
Zhang, Yin
Ding, Guangtai
Tong, Weiqin
IET IMAGE PROCESSING, 2020, 14 (06) : 1059 - 1065
[36] Basketball action recognition based on the combination of YOLO and a deep fuzzy LSTM network
Soroush Babaee Khobdeh
Mohammad Reza Yamaghani
Siavash Khodaparast Sareshkeh
The Journal of Supercomputing, 2024, 80 : 3528 - 3553
[37] Human Action Recognition Using Key-Frame Attention-Based LSTM Networks
Yang, Changxuan
Mei, Feng
Zang, Tuo
Tu, Jianfeng
Jiang, Nan
Liu, Lingfeng
ELECTRONICS, 2023, 12 (12)
[38] Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition
Rahevar, Mrugendrasinh
Ganatra, Amit
Saba, Tanzila
Rehman, Amjad
Bahaj, Saeed Ali
IEEE ACCESS, 2023, 11 : 21546 - 21553
[39] Spatial-Temporal gated graph attention network for skeleton-based action recognition
Rahevar, Mrugendrasinh
Ganatra, Amit
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 929 - 939
[40] Weakly-supervised temporal attention 3D network for human action recognition
Kim, Jonghyun
Li, Gen
Yun, Inyong
Jung, Cheolkon
Kim, Joongkyu
PATTERN RECOGNITION, 2021, 119

← 1 2 3 4 5 →