ENSEMBLE BASED FEATURE EXTRACTION AND DEEP LEARNING CLASSIFICATION MODEL WITH DEPTH VISION

被引：0

作者：

Sinha, Kumari Priyanka ^{[1
]}

Kumar, Prabhat ^{[2
]}

Ghosh, Rajib ^{[2
]}

机构：

[1] Nalanda Coll Engn, Dept Comp Sci & Engn, Chandi Bihar, India

[2] Natl Inst Technol Patna, Dept Comp Sci & Engn, Patna, India

来源：

COMPUTING AND INFORMATICS | 2023年 / 42卷 / 04期

关键词：

Human activities; improved LTXOR; BoW; Bi-LSTM; Bi-GRU classi-fier; HUMAN ACTIVITY RECOGNITION; WI-FI; ATTENTION; KNOWLEDGE; NETWORK;

D O I：

10.31577/cai2023_4_965

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It remains a challenging task to identify human activities from a video sequence or still image due to factors such as backdrop clutter, fractional occlu-sion, and changes in scale, point of view, appearance, and lighting. Different ap-pliances, as well as video surveillance systems, human-computer interfaces, and robots used to study human behavior, require different activity classification sys-tems. A four-stage framework for recognizing human activities is proposed in the paper. As part of the initial stages of pre-processing, video-to-frame con-version and adaptive histogram equalization (AHE) are performed. Additionally, watershed segmentation is performed and, from the segmented images, local tex-ton XOR patterns (LTXOR), motion boundary scale-invariant feature transforms (MoBSIFT) and bag of visual words (BoW) based features are extracted. The Bidirectional gated recurrent unit (Bi-GRU) and the Bidirectional long short-term memory (Bi-LSTM) classifiers are used to detect human activity. In addition, the combined decisions of the Bi-GRU and Bi-LSTM classifiers are further fused, and their accuracy levels are determined. With this Dempster-Shafer theory (DST) technique, it is more likely that the results obtained from the analysis are ac-curate. Various metrics are used to assess the effectiveness of the deployed ap-proach.

引用

页码：965 / 992

页数：28

共 58 条

[1] Bag-of-words with aggregated temporal pair-wise word co-occurrence for human action recognition
Agusti, Pau
Javier Traver, V.
Pla, Filiberto
[J]. PATTERN RECOGNITION LETTERS, 2014, 49 : 224 - 230
[2] A new technique for combining multiple classifiers using the Dempster-Shafer theory of evidence
Al-Ani, M
Deriche, M
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 17 : 333 - 361
[3] Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
AlDahoul, Nouar
Sabri, Aznul Qalid Md
Mansoor, Ali Mohammed
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[4] [Anonymous], 2011, UCF-ARG Data Set
[5] Local texton XOR patterns: A new feature descriptor for content-based image retrieval
Bala, Anu
Kaur, Tajinder
[J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (01): : 101 - 112
[6] DGRU based human activity recognition using channel state information
Bokhari, Syed Mohsin
Sohaib, Sarmad
Khan, Ahsan Raza
Shafi, Muhammad
Khan, Atta Ur Rehman
[J]. MEASUREMENT, 2021, 167
[7] Focus-of-Attention for Human Activity Recognition from UAVs
Burghouts, G. J.
van Eekeren, A. W. M.
Dijk, J.
[J]. ELECTRO-OPTICAL AND INFRARED SYSTEMS: TECHNOLOGY AND APPLICATIONS XI, 2014, 9249
[8] Human Action Recognition Using Improved Sparse Gaussian Process Latent Variable Model and Hidden Conditional Random Filed
Cai, Linqin
Liu, Xiaolin
Ding, Heen
Chen, Fuli
[J]. IEEE ACCESS, 2018, 6 : 20047 - 20057
[9] Distilling the Knowledge From Handcrafted Features for Human Activity Recognition
Chen, Zhenghua
Zhang, Le
Cao, Zhiguang
Guo, Jing
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (10) : 4334 - 4342
[10] Robust Human Activity Recognition Using Multimodel Feature-Level Fusion
Ehatisham-Ul-Haq, Muhammad
Javed, Ali
Azam, Muhammad Awais
Malik, Hafiz M. A.
Irtaza, Aun
Lee, Ik Hyun
Mahmood, Muhammad Tariq
[J]. IEEE ACCESS, 2019, 7 : 60736 - 60751

← 1 2 3 4 5 6 →