DTA:Double LSTM with Temporal-wise Attention Network for Action Recognition

被引:0
|
作者
Xu, Yangyang [1 ,2 ]
Wang, Lei [2 ,3 ]
Cheng, Jun [2 ,3 ]
Xia, Haiying [1 ]
Yin, Jianqin [4 ]
机构
[1] Guangxi Normal Univ, Guilin, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Virtual Real & Human Interact Te, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Automat, Beijing, Peoples R China
来源
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年
基金
中国国家自然科学基金;
关键词
Action Recognition; CNN; LSTM; Attention Model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a new architecture for human action recognition by using a convolution neural networks (CNN) and two Long Short-Term Memory(LSTM) networks with temporal-wise attention model. We call this network the Double LSTM with Temporal-wise Attention network (DTA). The features extracted by our model are both spatially and temporally. The attention model can learn which parts in which frames in a video are relevant to the video label and pay more attention on them. We designed a joint optimization layer (JOL) to jointly process two kinds of feature produced by two LSTMs. The proposed networks achieved improved performance on three widely used datasets-the UCF Sports dataset, the UCF11 dataset and the HMDB51 dataset.
引用
收藏
页码:1676 / 1680
页数:5
相关论文
共 50 条
  • [31] Graph transformer network with temporal kernel attention for skeleton-based action recognition
    Liu, Yanan
    Zhang, Hao
    Xu, Dan
    He, Kangjian
    KNOWLEDGE-BASED SYSTEMS, 2022, 240
  • [32] Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
    Liu, Jun
    Shahroudy, Amir
    Xu, Dong
    Kot, Alex C.
    Wang, Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) : 3007 - 3021
  • [33] Basketball action recognition based on the combination of YOLO and a deep fuzzy LSTM network
    Khobdeh, Soroush Babaee
    Yamaghani, Mohammad Reza
    Sareshkeh, Siavash Khodaparast
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (03) : 3528 - 3553
  • [34] Joint spatial-temporal attention for action recognition
    Yu, Tingzhao
    Guo, Chaoxu
    Wang, Lingfeng
    Gu, Huxiang
    Xiang, Shiming
    Pan, Chunhong
    PATTERN RECOGNITION LETTERS, 2018, 112 : 226 - 233
  • [35] Dual attention convolutional network for action recognition
    Li, Xiaoqiang
    Xie, Miao
    Zhang, Yin
    Ding, Guangtai
    Tong, Weiqin
    IET IMAGE PROCESSING, 2020, 14 (06) : 1059 - 1065
  • [36] Basketball action recognition based on the combination of YOLO and a deep fuzzy LSTM network
    Soroush Babaee Khobdeh
    Mohammad Reza Yamaghani
    Siavash Khodaparast Sareshkeh
    The Journal of Supercomputing, 2024, 80 : 3528 - 3553
  • [37] Human Action Recognition Using Key-Frame Attention-Based LSTM Networks
    Yang, Changxuan
    Mei, Feng
    Zang, Tuo
    Tu, Jianfeng
    Jiang, Nan
    Liu, Lingfeng
    ELECTRONICS, 2023, 12 (12)
  • [38] Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    Saba, Tanzila
    Rehman, Amjad
    Bahaj, Saeed Ali
    IEEE ACCESS, 2023, 11 : 21546 - 21553
  • [39] Spatial-Temporal gated graph attention network for skeleton-based action recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 929 - 939
  • [40] Weakly-supervised temporal attention 3D network for human action recognition
    Kim, Jonghyun
    Li, Gen
    Yun, Inyong
    Jung, Cheolkon
    Kim, Joongkyu
    PATTERN RECOGNITION, 2021, 119