Residual deep gated recurrent unit-based attention framework for human activity recognition by exploiting dilated features

被引:1
|
作者
Pandey, Ajeet [1 ]
Kumar, Piyush [1 ]
机构
[1] Natl Inst Technol Patna, Comp Sci & Engn, Patna 800005, Bihar, India
来源
VISUAL COMPUTER | 2024年 / 40卷 / 12期
关键词
Dilated convolutional neural network; Gated recurrent unit; Attention mechanism; Action recognition; Residual mechanism; NETWORK; LSTM;
D O I
10.1007/s00371-024-03266-w
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human activity recognition (HAR) in video streams becomes a thriving research area in computer vision and pattern recognition. Activity recognition in actual video is quite demanding due to a lack of data with respect to motion, way or style, and cluttered background. The current HAR approaches primarily apply pre-trained weights of various deep learning (DL) models for the apparent description of frames during the learning phase. It impacts the assessment of feature discrepancies, like the separation between both the temporal and visual cues. To address this issue, a residual deep gated recurrent unit (RD-GRU)-enabled attention framework with a dilated convolutional neural network (DiCNN) is introduced in this article. This approach particularly targets potential information in the input video frame to recognize the distinct activities in the videos. The DiCNN network is used to capture the crucial, unique features. In this network, the skip connection segment is employed with DiCNN to update the information that retains more knowledge than a shallow layer. Moreover, these features are fed into an attention module to capture the added high-level discriminative action associated with patterns and signs. The attention mechanism is followed by an RD-GRU to learn the long video sequences in order to enhance the performance. The performance metrics, namely accuracy, precision, recall, and f1-score, are used to evaluate the performance of the introduced model on four diverse benchmark datasets: UCF11, UCF Sports, JHMDB, and THUMOS. On these datasets it achieves an accuracy of 98.54%, 99.31%, 82.47%, and 95.23%, respectively. This illustrates the validity of the proposed work compared with state-of-the-art (SOTA) methods.
引用
收藏
页码:8693 / 8712
页数:20
相关论文
共 50 条
  • [1] SDIGRU: Spatial and Deep Features Integration Using Multilayer Gated Recurrent Unit for Human Activity Recognition
    Ahmad, Tariq
    Wu, Jinsong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 973 - 985
  • [2] Attention-Based Convolutional Neural Network and Bidirectional Gated Recurrent Unit for Human Activity Recognition
    Tao, Shuai
    Zhao, Zhiqiang
    Qin, Jing
    Ji, Changqing
    Wang, Zumin
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1128 - 1134
  • [3] Attention-Based Gated Recurrent Unit for Gesture Recognition
    Khodabandelou, Ghazaleh
    Jung, Pyeong-Gook
    Amirat, Yacine
    Mohammed, Samer
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 18 (02) : 495 - 507
  • [4] A customized residual neural network and bi-directional gated recurrent unit-based automatic speech recognition model
    Reza, Selim
    Ferreira, Marta Campos
    Machado, J. J. M.
    Tavares, Joao Manuel R. S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [5] Human Activity Recognition Based on Deep-Temporal Learning Using Convolution Neural Networks Features and Bidirectional Gated Recurrent Unit With Features Selection
    Ahmad, Tariq
    Wu, Jinsong
    Alwageed, Hathal Salamah
    Khan, Faheem
    Khan, Jawad
    Lee, Youngmoon
    IEEE ACCESS, 2023, 11 : 33148 - 33159
  • [6] A Novel CNN, Bidirectional Long-Short Term Memory, and Gated Recurrent Unit-Based Hybrid Approach for Human Activity Recognition
    Thakur, Narina
    Singh, Sunil K.
    Gupta, Akash
    Jain, Kunal
    Jain, Rachna
    Perakovic, Dragan
    Nedjah, Nadia
    Rafsanjani, Marjan Kuchaki
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [7] Deep Gated Recurrent Unit-Based 3D Localization for UWB Systems
    Nguyen, Doan Tan Anh
    Joung, Jingon
    Kang, Xin
    IEEE ACCESS, 2021, 9 (09): : 68798 - 68813
  • [8] A model for electroencephalogram emotion recognition: Residual block-gated recurrent unit with attention mechanism
    Wang, Yujie
    Zhang, Xiu
    Zhang, Xin
    Sun, Baiwei
    Xu, Bingyue
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2024, 95 (08):
  • [9] Real Time Human Activity Recognition Using Convolutional Neural Network and Deep Gated Recurrent Unit
    Fajar, Rasyid
    Suciati, Nanik
    Navastara, Dini Adni
    2020 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICELTICS 2020), 2020, : 58 - 63
  • [10] Attention Aware Bidirectional Gated Recurrent Unit Based Framework for Sentiment Analysis
    Tian, Zhengxi
    Rong, Wenge
    Shi, Libin
    Liu, Jingshuang
    Xiong, Zhang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 67 - 78