DTA:Double LSTM with Temporal-wise Attention Network for Action Recognition

被引:0
|
作者
Xu, Yangyang [1 ,2 ]
Wang, Lei [2 ,3 ]
Cheng, Jun [2 ,3 ]
Xia, Haiying [1 ]
Yin, Jianqin [4 ]
机构
[1] Guangxi Normal Univ, Guilin, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Virtual Real & Human Interact Te, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China
来源
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年
基金
中国国家自然科学基金;
关键词
Action Recognition; CNN; LSTM; Attention Model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a new architecture for human action recognition by using a convolution neural networks (CNN) and two Long Short-Term Memory(LSTM) networks with temporal-wise attention model. We call this network the Double LSTM with Temporal-wise Attention network (DTA). The features extracted by our model are both spatially and temporally. The attention model can learn which parts in which frames in a video are relevant to the video label and pay more attention on them. We designed a joint optimization layer (JOL) to jointly process two kinds of feature produced by two LSTMs. The proposed networks achieved improved performance on three widely used datasets-the UCF Sports dataset, the UCF11 dataset and the HMDB51 dataset.
引用
收藏
页码:1676 / 1680
页数:5
相关论文
共 50 条
  • [41] Temporal Segment Connection Network for Action Recognition
    Li, Qian
    Yang, Wenzhu
    Chen, Xiangyang
    Yuan, Tongtong
    Wang, Yuxia
    IEEE ACCESS, 2020, 8 : 179118 - 179127
  • [42] Global Temporal Difference Network for Action Recognition
    Xie, Zhao
    Chen, Jiansong
    Wu, Kewei
    Guo, Dan
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7594 - 7606
  • [43] Human Action Recognition Algorithm Based on Bi-LSTM-Attention Model
    Zhu Mingkang
    Lu Xianling
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (15)
  • [44] Spatio-Temporal Attention Networks for Action Recognition and Detection
    Li, Jun
    Liu, Xianglong
    Zhang, Wenxuan
    Zhang, Mingyuan
    Song, Jingkuan
    Sebe, Nicu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2990 - 3001
  • [45] 3D-STARNET: Spatial-Temporal Attention Residual Network for Robust Action Recognition
    Yang, Jun
    Sun, Shulong
    Chen, Jiayue
    Xie, Haizhen
    Wang, Yan
    Yang, Zenglong
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [46] Content-Aware Attention Network for Action Recognition
    Liu, Ziyi
    Wang, Le
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 109 - 120
  • [47] Differential motion attention network for efficient action recognition
    Liu, Caifeng
    Gu, Fangjie
    VISUAL COMPUTER, 2025, 41 (03) : 1719 - 1731
  • [48] Self-Attention Pooling-Based Long-Term Temporal Network for Action Recognition
    Li, Huifang
    Huang, Jingwei
    Zhou, Mengchu
    Shi, Qisong
    Fei, Qing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 65 - 77
  • [49] Human action recognition based on spatial-temporal relational model and LSTM-CNN framework
    Senthilkumar, N.
    Manimegalai, M.
    Karpakam, S.
    Ashokkumar, S. R.
    Premkumar, M.
    MATERIALS TODAY-PROCEEDINGS, 2022, 57 : 2087 - 2091
  • [50] ResLNet: deep residual LSTM network with longer input for action recognition
    Wang, Tian
    Li, Jiakun
    Wu, Huai-Ning
    Li, Ce
    Snoussi, Hichem
    Wu, Yang
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (06)