DTA:Double LSTM with Temporal-wise Attention Network for Action Recognition

被引:0
|
作者
Xu, Yangyang [1 ,2 ]
Wang, Lei [2 ,3 ]
Cheng, Jun [2 ,3 ]
Xia, Haiying [1 ]
Yin, Jianqin [4 ]
机构
[1] Guangxi Normal Univ, Guilin, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Virtual Real & Human Interact Te, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China
来源
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年
基金
中国国家自然科学基金;
关键词
Action Recognition; CNN; LSTM; Attention Model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a new architecture for human action recognition by using a convolution neural networks (CNN) and two Long Short-Term Memory(LSTM) networks with temporal-wise attention model. We call this network the Double LSTM with Temporal-wise Attention network (DTA). The features extracted by our model are both spatially and temporally. The attention model can learn which parts in which frames in a video are relevant to the video label and pay more attention on them. We designed a joint optimization layer (JOL) to jointly process two kinds of feature produced by two LSTMs. The proposed networks achieved improved performance on three widely used datasets-the UCF Sports dataset, the UCF11 dataset and the HMDB51 dataset.
引用
收藏
页码:1676 / 1680
页数:5
相关论文
共 50 条
  • [21] EEG-based emotion recognition via capsule network with channel-wise attention and LSTM models
    Lina Deng
    Xiaoliang Wang
    Frank Jiang
    Robin Doss
    CCF Transactions on Pervasive Computing and Interaction, 2021, 3 : 425 - 435
  • [22] EEG-based emotion recognition via capsule network with channel-wise attention and LSTM models
    Deng, Lina
    Wang, Xiaoliang
    Jiang, Frank
    Doss, Robin
    CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2021, 3 (04) : 425 - 435
  • [23] SPATIO-TEMPORAL SLOWFAST SELF-ATTENTION NETWORK FOR ACTION RECOGNITION
    Kim, Myeongjun
    Kim, Taehun
    Kim, Daijin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2206 - 2210
  • [24] Temporal Group Deep Network Action Recognition Algorithm Based on Attention Mechanism
    Hu Z.
    Diao P.
    Zhang R.
    Li S.
    Zhao M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 892 - 900
  • [25] Residual attention fusion network for video action recognition
    Li, Ao
    Yi, Yang
    Liang, Daan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [26] Spatial-Temporal Attention for Action Recognition
    Sun, Dengdi
    Wu, Hanqing
    Ding, Zhuanlian
    Luo, Bin
    Tang, Jin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 854 - 864
  • [27] IMPROVING HUMAN ACTION RECOGNITION BY TEMPORAL ATTENTION
    Liu, Zhikang
    Tian, Ye
    Wang, Zilei
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 870 - 874
  • [28] Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection
    Song, Sijie
    Lan, Cuiling
    Xing, Junliang
    Zeng, Wenjun
    Liu, Jiaying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3459 - 3471
  • [29] Spatial–Temporal gated graph attention network for skeleton-based action recognition
    Mrugendrasinh Rahevar
    Amit Ganatra
    Pattern Analysis and Applications, 2023, 26 (3) : 929 - 939
  • [30] R-STAN: Residual Spatial-Temporal Attention Network for Action Recognition
    Liu, Quanle
    Che, Xiangjiu
    Bie, Mei
    IEEE ACCESS, 2019, 7 : 82246 - 82255