Human activity recognition from uav videos using an optimized hybrid deep learning model

被引：2

作者：

Sinha, Kumari Priyanka ^{[1
]}

Kumar, Prabhat ^{[2
]}

机构：

[1] Nalanda Coll Engn, Dept CSE, Chandi, India

[2] Natl Inst Technol Patna, Dept CSE, Patna 800005, Bihar, India

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 17期

关键词：

Human Activity Recognition (HAR); Contrast enhancement; Bag of Visual Words (BoVW); Improved LGTP features; Convolutional neural network; Blue Monkey Standardized Aquila Optimization (BMSAO); NETWORK; FEATURES; SENSORS;

D O I：

10.1007/s11042-023-17289-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human activity recognition (HAR) is an important research area in both machine learning and human-computer interactions. Unfortunately, it remains an extremely difficult process owing to unsolvable issues, such as sensor movement, sensor positioning, crowded background, and inherent diversity in task performances by distinct humans. In this study, we developed an ensemble of classification models for the HAR. The proposed HAR has four working phases-preprocessing, segmentation, feature extraction, and classification. The pre-processing phase includes processes such as frame conversion and contrast enhancement. We developed an improved balanced iterative reducing and clustering utilising hierarchies (BIRCH) algorithm, that provides efficient segmentation by utilizing only minimal resources. These segmented images are subjected to feature extraction, in which grey level co-occurrence matrix (GLCM) features, and improved local gradient threshold pattern (LGTP) features are extracted along with conventional bag of visual words (BoVW) to provide better results. An ensemble classification model with classifiers such as Bi-GRU, CNN, and LSTM was developed in this study to provide an accurate classification. To enhance the performance of the proposed model, we developed a blue monkey standardized aquila optimization (BMSAO) approach. Conventional techniques are contrasted with the proposed framework. The proposed mechanism was found to have higher efficiency in HAR after it was experimentally evaluated.

引用

页码：51669 / 51698

页数：30

共 50 条

[1] Aquila Optimizer: A novel meta-heuristic optimization algorithm
Abualigah, Laith
Yousri, Dalia
Abd Elaziz, Mohamed
Ewees, Ahmed A.
Al-qaness, Mohammed A. A.
Gandomi, Amir H.
[J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 157 (157)
[2] Bag-of-words with aggregated temporal pair-wise word co-occurrence for human action recognition
Agusti, Pau
Javier Traver, V.
Pla, Filiberto
[J]. PATTERN RECOGNITION LETTERS, 2014, 49 : 224 - 230
[3] Ahmed Sohail, 2016, 2016 13th International Conference on Service Systems and Service Management (ICSSSM), P1, DOI 10.1109/ICSSSM.2016.7538459
[4] Facial Expression Recognition Using Local Transitional Pattern on Gabor Filtered Facial Images
Ahsan, Tanveer
Jabid, Taskeed
Chong, Ui-Pil
[J]. IETE TECHNICAL REVIEW, 2013, 30 (01) : 47 - 52
[5] A new technique for combining multiple classifiers using the Dempster-Shafer theory of evidence
Al-Ani, M
Deriche, M
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 17 : 333 - 361
[6] Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
AlDahoul, Nouar
Sabri, Aznul Qalid Md
Mansoor, Ali Mohammed
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[7] CHARM-Deep: Continuous Human Activity Recognition Model Based on Deep Neural Network Using IMU Sensors of Smartwatch
Ashry, Sara
Ogawa, Tetsuji
Gomaa, Walid
[J]. IEEE SENSORS JOURNAL, 2020, 20 (15) : 8757 - 8770
[8] Human action recognition with bag of visual words using different machine learning methods and hyperparameter optimization
Aslan, Muhammet Fatih
Durdu, Akif
Sabanci, Kadir
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12) : 8585 - 8597
[9] Local texton XOR patterns: A new feature descriptor for content-based image retrieval
Bala, Anu
Kaur, Tajinder
[J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (01): : 101 - 112
[10] Vision-based human activity recognition: a survey
Beddiar, Djamila Romaissa
Nini, Brahim
Sabokrou, Mohammad
Hadid, Abdenour
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 30509 - 30555

← 1 2 3 4 5 →