ENSEMBLE BASED FEATURE EXTRACTION AND DEEP LEARNING CLASSIFICATION MODEL WITH DEPTH VISION

被引:0
作者
Sinha, Kumari Priyanka [1 ]
Kumar, Prabhat [2 ]
Ghosh, Rajib [2 ]
机构
[1] Nalanda Coll Engn, Dept Comp Sci & Engn, Chandi Bihar, India
[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna, India
关键词
Human activities; improved LTXOR; BoW; Bi-LSTM; Bi-GRU classi-fier; HUMAN ACTIVITY RECOGNITION; WI-FI; ATTENTION; KNOWLEDGE; NETWORK;
D O I
10.31577/cai2023_4_965
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It remains a challenging task to identify human activities from a video sequence or still image due to factors such as backdrop clutter, fractional occlu-sion, and changes in scale, point of view, appearance, and lighting. Different ap-pliances, as well as video surveillance systems, human-computer interfaces, and robots used to study human behavior, require different activity classification sys-tems. A four-stage framework for recognizing human activities is proposed in the paper. As part of the initial stages of pre-processing, video-to-frame con-version and adaptive histogram equalization (AHE) are performed. Additionally, watershed segmentation is performed and, from the segmented images, local tex-ton XOR patterns (LTXOR), motion boundary scale-invariant feature transforms (MoBSIFT) and bag of visual words (BoW) based features are extracted. The Bidirectional gated recurrent unit (Bi-GRU) and the Bidirectional long short-term memory (Bi-LSTM) classifiers are used to detect human activity. In addition, the combined decisions of the Bi-GRU and Bi-LSTM classifiers are further fused, and their accuracy levels are determined. With this Dempster-Shafer theory (DST) technique, it is more likely that the results obtained from the analysis are ac-curate. Various metrics are used to assess the effectiveness of the deployed ap-proach.
引用
收藏
页码:965 / 992
页数:28
相关论文
共 58 条
  • [1] Bag-of-words with aggregated temporal pair-wise word co-occurrence for human action recognition
    Agusti, Pau
    Javier Traver, V.
    Pla, Filiberto
    [J]. PATTERN RECOGNITION LETTERS, 2014, 49 : 224 - 230
  • [2] A new technique for combining multiple classifiers using the Dempster-Shafer theory of evidence
    Al-Ani, M
    Deriche, M
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 17 : 333 - 361
  • [3] Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
    AlDahoul, Nouar
    Sabri, Aznul Qalid Md
    Mansoor, Ali Mohammed
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [4] [Anonymous], 2011, UCF-ARG Data Set
  • [5] Local texton XOR patterns: A new feature descriptor for content-based image retrieval
    Bala, Anu
    Kaur, Tajinder
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (01): : 101 - 112
  • [6] DGRU based human activity recognition using channel state information
    Bokhari, Syed Mohsin
    Sohaib, Sarmad
    Khan, Ahsan Raza
    Shafi, Muhammad
    Khan, Atta Ur Rehman
    [J]. MEASUREMENT, 2021, 167
  • [7] Focus-of-Attention for Human Activity Recognition from UAVs
    Burghouts, G. J.
    van Eekeren, A. W. M.
    Dijk, J.
    [J]. ELECTRO-OPTICAL AND INFRARED SYSTEMS: TECHNOLOGY AND APPLICATIONS XI, 2014, 9249
  • [8] Human Action Recognition Using Improved Sparse Gaussian Process Latent Variable Model and Hidden Conditional Random Filed
    Cai, Linqin
    Liu, Xiaolin
    Ding, Heen
    Chen, Fuli
    [J]. IEEE ACCESS, 2018, 6 : 20047 - 20057
  • [9] Distilling the Knowledge From Handcrafted Features for Human Activity Recognition
    Chen, Zhenghua
    Zhang, Le
    Cao, Zhiguang
    Guo, Jing
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (10) : 4334 - 4342
  • [10] Robust Human Activity Recognition Using Multimodel Feature-Level Fusion
    Ehatisham-Ul-Haq, Muhammad
    Javed, Ali
    Azam, Muhammad Awais
    Malik, Hafiz M. A.
    Irtaza, Aun
    Lee, Ik Hyun
    Mahmood, Muhammad Tariq
    [J]. IEEE ACCESS, 2019, 7 : 60736 - 60751