DTA:Double LSTM with Temporal-wise Attention Network for Action Recognition

被引：0

作者：

Xu, Yangyang ^{[1
,2
]}

Wang, Lei ^{[2
,3
]}

Cheng, Jun ^{[2
,3
]}

Xia, Haiying ^{[1
]}

Yin, Jianqin ^{[4
]}

机构：

[1] Guangxi Normal Univ, Guilin, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Virtual Real & Human Interact Te, Shenzhen, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China

[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China

来源：

PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年

基金：

中国国家自然科学基金;

关键词：

Action Recognition; CNN; LSTM; Attention Model;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a new architecture for human action recognition by using a convolution neural networks (CNN) and two Long Short-Term Memory(LSTM) networks with temporal-wise attention model. We call this network the Double LSTM with Temporal-wise Attention network (DTA). The features extracted by our model are both spatially and temporally. The attention model can learn which parts in which frames in a video are relevant to the video label and pay more attention on them. We designed a joint optimization layer (JOL) to jointly process two kinds of feature produced by two LSTMs. The proposed networks achieved improved performance on three widely used datasets-the UCF Sports dataset, the UCF11 dataset and the HMDB51 dataset.

引用

页码：1676 / 1680

页数：5

共 50 条

[41] Temporal Segment Connection Network for Action Recognition
Li, Qian
Yang, Wenzhu
Chen, Xiangyang
Yuan, Tongtong
Wang, Yuxia
IEEE ACCESS, 2020, 8 : 179118 - 179127
[42] Global Temporal Difference Network for Action Recognition
Xie, Zhao
Chen, Jiansong
Wu, Kewei
Guo, Dan
Hong, Richang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7594 - 7606
[43] Human Action Recognition Algorithm Based on Bi-LSTM-Attention Model
Zhu Mingkang
Lu Xianling
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (15)
[44] Spatio-Temporal Attention Networks for Action Recognition and Detection
Li, Jun
Liu, Xianglong
Zhang, Wenxuan
Zhang, Mingyuan
Song, Jingkuan
Sebe, Nicu
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2990 - 3001
[45] 3D-STARNET: Spatial-Temporal Attention Residual Network for Robust Action Recognition
Yang, Jun
Sun, Shulong
Chen, Jiayue
Xie, Haizhen
Wang, Yan
Yang, Zenglong
APPLIED SCIENCES-BASEL, 2024, 14 (16):
[46] Content-Aware Attention Network for Action Recognition
Liu, Ziyi
Wang, Le
Zheng, Nanning
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 109 - 120
[47] Differential motion attention network for efficient action recognition
Liu, Caifeng
Gu, Fangjie
VISUAL COMPUTER, 2025, 41 (03) : 1719 - 1731
[48] Self-Attention Pooling-Based Long-Term Temporal Network for Action Recognition
Li, Huifang
Huang, Jingwei
Zhou, Mengchu
Shi, Qisong
Fei, Qing
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 65 - 77
[49] Human action recognition based on spatial-temporal relational model and LSTM-CNN framework
Senthilkumar, N.
Manimegalai, M.
Karpakam, S.
Ashokkumar, S. R.
Premkumar, M.
MATERIALS TODAY-PROCEEDINGS, 2022, 57 : 2087 - 2091
[50] ResLNet: deep residual LSTM network with longer input for action recognition
Wang, Tian
Li, Jiakun
Wu, Huai-Ning
Li, Ce
Snoussi, Hichem
Wu, Yang
FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (06)

← 1 2 3 4 5 →