Human Action Recognition Using Key-Frame Attention-Based LSTM Networks

被引:1
|
作者
Yang, Changxuan [1 ]
Mei, Feng [1 ]
Zang, Tuo [1 ]
Tu, Jianfeng [1 ]
Jiang, Nan [1 ]
Liu, Lingfeng [1 ,2 ]
机构
[1] East China JiaoTong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] Jiangxi Minxuan Intelligent Technol Co Ltd, Nanchang 330029, Peoples R China
关键词
action recognition; ARMA; attention mechanism; key-frame extraction; K-means; LSTM; REPRESENTATION;
D O I
10.3390/electronics12122622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is a classical problem in computer vision and machine learning, and the task of effectively and efficiently recognising human actions is a concern for researchers. In this paper, we propose a key-frame-based approach to human action recognition. First, we designed a key-frame attention-based LSTM network (KF-LSTM) using the attention mechanism, which can be combined with LSTM to effectively recognise human action sequences by assigning different weight scale values to give more attention to key frames. In addition, we designed a new key-frame extraction method by combining an automatic segmentation model based on the autoregressive moving average (ARMA) algorithm and the K-means clustering algorithm. This method effectively avoids the possibility of inter-frame confusion in the temporal sequence of key frames of different actions and ensures that the subsequent human action recognition task proceeds smoothly. The dataset used in the experiments was acquired with an IMU sensor-based motion capture device, and we separately extracted the motion features of each joint using a manual method and then performed collective inference.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Temporal-scale Convolutional Networks for Human Action Recognition Based on Key-Frame Extraction
    Wei, Zhao-qiang
    Kong, Yong-qiang
    Wei, Zhen-gang
    Zhang, Xiao-long
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 484 - 489
  • [2] A new Approach to Speed up in Action Recognition Based on Key-frame Extraction
    Azouji, Neda
    Azimifar, Zohreh
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 219 - 222
  • [3] Advancing human action recognition: A hybrid approach using attention-based LSTM and 3D CNN
    Saoudi, El Mehdi
    Jaafari, Jaafar
    Andaloussi, Said Jai
    SCIENTIFIC AFRICAN, 2023, 21
  • [4] Attention-based LSTM Network for Wearable Human Activity Recognition
    Sun, Bo
    Liu, Meiqin
    Zheng, Ronghao
    Zhang, Senlin
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8677 - 8682
  • [5] Attention-Based Dense LSTM for Speech Emotion Recognition
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Zhao, Li
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (07): : 1426 - 1429
  • [6] Recognition of Ironic Sentences in Twitter using Attention-Based LSTM
    Martini, Andrianarisoa Tojo
    Farrukh, Makhmudov
    Ge, Hongwei
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 7 - 11
  • [7] Human action recognition using attention based LSTM network with dilated CNN features
    Muhammad, Khan
    Mustaqeem
    Ullah, Amin
    Imran, Ali Shariq
    Sajjad, Muhammad
    Kiran, Mustafa Servet
    Sannino, Giovanna
    de Albuquerque, Victor Hugo C.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 820 - 830
  • [8] Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection
    Song, Sijie
    Lan, Cuiling
    Xing, Junliang
    Zeng, Wenjun
    Liu, Jiaying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3459 - 3471
  • [9] Real-time human action prediction using pose estimation with attention-based LSTM network
    A. Bharathi
    Rigved Sanku
    M. Sridevi
    S. Manusubramanian
    S. Kumar Chandar
    Signal, Image and Video Processing, 2024, 18 : 3255 - 3264
  • [10] Real-time human action prediction using pose estimation with attention-based LSTM network
    Bharathi, A.
    Sanku, Rigved
    Sridevi, M.
    Manusubramanian, S.
    Chandar, S. Kumar
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3255 - 3264