Optimized hybrid RNN model for human activity recognition in untrimmed video

被引:4
作者
Deotale, Disha [1 ]
Verma, Madhushi [2 ]
Suresh, Perumbure [3 ]
Kotecha, Ketan [4 ]
机构
[1] GH Raisoni Coll Engn & Management, Dept Artificial Intelligence, Pune, Maharashtra, India
[2] Bennett Univ, Sch Comp Sci Engn & Technol, Greater Noida, India
[3] Muthoot Inst Technol & Sci, CIDRIE, Kochi, Kerala, India
[4] Symbiosis Int Univ, Symbiosis Ctr Appl Artificial Intelligence, Pune, Maharashtra, India
关键词
untrimmed; convolutional neural networks; recurrent neural network; long short-term memory; gated recurrent unit; human action recognition;
D O I
10.1117/1.JEI.31.5.051409
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human activity recognition is a field of video processing that requires restricted temporal analysis of video sequences for estimating the existence of different human actions. Designing an efficient human activity model requires credible implementations of keyframe extraction, preprocessing, feature extraction and selection, classification, and pattern recognition methods. In the real-time video, sequences are untrimmed and do not have any activity endpoints for effective recognition. Thus, we propose a hybrid gated recurrent unit and long short-term memory-based recurrent neural network model for high-efficiency human action recognition in untrimmed video datasets. The proposed model is tested on the TRECVID dataset, along with other online datasets, and is observed to have an accuracy of over 91% for untrimmed video-based activity recognition. This accuracy is compared with various state-of-the-art models and is found to be higher when evaluated on multiple datasets. (c) 2022 SPIE and IS&T
引用
收藏
页数:17
相关论文
共 42 条
  • [1] A Comparative Analysis of Hybrid Deep Learning Models for Human Activity Recognition
    Abbaspour, Saedeh
    Fotouhi, Faranak
    Sedaghatbaf, Ali
    Fotouhi, Hossein
    Vahabi, Maryam
    Linden, Maria
    [J]. SENSORS, 2020, 20 (19) : 1 - 14
  • [2] Aly CA, 2019, IEEE I C SIGNAL IMAG, P329, DOI [10.1109/ICSIPA45851.2019.8977792, 10.1109/icsipa45851.2019.8977792]
  • [3] [Anonymous], 2017, About us
  • [4] [Anonymous], 2021, INT J RECENT TECHNOL, P233, DOI DOI 10.35940/IJRTE.A5896.0510121
  • [5] Video representation by dense trajectories motion map applied to human activity recognition
    Arif S.
    Ul-Hassan T.
    Hussain F.
    Wang J.
    Fei Z.
    [J]. International Journal of Computers and Applications, 2020, 42 (05) : 474 - 484
  • [6] Video Activity Recognition With Varying Rhythms
    Ayhan, Bulent
    Kwan, Chiman
    Budavari, Bence
    Larkin, Jude
    Gribben, David
    Li, Baoxin
    [J]. IEEE ACCESS, 2020, 8 : 191997 - 192008
  • [7] Vision-based human activity recognition: a survey
    Beddiar, Djamila Romaissa
    Nini, Brahim
    Sabokrou, Mohammad
    Hadid, Abdenour
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 30509 - 30555
  • [8] Gradient local auto-correlation features for depth human action recognition
    Bulbul, Mohammad Farhad
    Ali, Hazrat
    [J]. SN APPLIED SCIENCES, 2021, 3 (05):
  • [9] A Shot boundary Detection Technique based on Visual Colour Information
    Chakraborty, Saptarshi
    Thounaojam, Dalton Meitei
    Sinha, Nidul
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 4007 - 4022
  • [10] View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics
    Dhiman, Chhavi
    Vishwakarma, Dinesh Kumar
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3835 - 3844