Human activity recognition from uav videos using an optimized hybrid deep learning model

被引:2
作者
Sinha, Kumari Priyanka [1 ]
Kumar, Prabhat [2 ]
机构
[1] Nalanda Coll Engn, Dept CSE, Chandi, India
[2] Natl Inst Technol Patna, Dept CSE, Patna 800005, Bihar, India
关键词
Human Activity Recognition (HAR); Contrast enhancement; Bag of Visual Words (BoVW); Improved LGTP features; Convolutional neural network; Blue Monkey Standardized Aquila Optimization (BMSAO); NETWORK; FEATURES; SENSORS;
D O I
10.1007/s11042-023-17289-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human activity recognition (HAR) is an important research area in both machine learning and human-computer interactions. Unfortunately, it remains an extremely difficult process owing to unsolvable issues, such as sensor movement, sensor positioning, crowded background, and inherent diversity in task performances by distinct humans. In this study, we developed an ensemble of classification models for the HAR. The proposed HAR has four working phases-preprocessing, segmentation, feature extraction, and classification. The pre-processing phase includes processes such as frame conversion and contrast enhancement. We developed an improved balanced iterative reducing and clustering utilising hierarchies (BIRCH) algorithm, that provides efficient segmentation by utilizing only minimal resources. These segmented images are subjected to feature extraction, in which grey level co-occurrence matrix (GLCM) features, and improved local gradient threshold pattern (LGTP) features are extracted along with conventional bag of visual words (BoVW) to provide better results. An ensemble classification model with classifiers such as Bi-GRU, CNN, and LSTM was developed in this study to provide an accurate classification. To enhance the performance of the proposed model, we developed a blue monkey standardized aquila optimization (BMSAO) approach. Conventional techniques are contrasted with the proposed framework. The proposed mechanism was found to have higher efficiency in HAR after it was experimentally evaluated.
引用
收藏
页码:51669 / 51698
页数:30
相关论文
共 50 条
  • [1] Aquila Optimizer: A novel meta-heuristic optimization algorithm
    Abualigah, Laith
    Yousri, Dalia
    Abd Elaziz, Mohamed
    Ewees, Ahmed A.
    Al-qaness, Mohammed A. A.
    Gandomi, Amir H.
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 157 (157)
  • [2] Bag-of-words with aggregated temporal pair-wise word co-occurrence for human action recognition
    Agusti, Pau
    Javier Traver, V.
    Pla, Filiberto
    [J]. PATTERN RECOGNITION LETTERS, 2014, 49 : 224 - 230
  • [3] Ahmed Sohail, 2016, 2016 13th International Conference on Service Systems and Service Management (ICSSSM), P1, DOI 10.1109/ICSSSM.2016.7538459
  • [4] Facial Expression Recognition Using Local Transitional Pattern on Gabor Filtered Facial Images
    Ahsan, Tanveer
    Jabid, Taskeed
    Chong, Ui-Pil
    [J]. IETE TECHNICAL REVIEW, 2013, 30 (01) : 47 - 52
  • [5] A new technique for combining multiple classifiers using the Dempster-Shafer theory of evidence
    Al-Ani, M
    Deriche, M
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 17 : 333 - 361
  • [6] Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models
    AlDahoul, Nouar
    Sabri, Aznul Qalid Md
    Mansoor, Ali Mohammed
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [7] CHARM-Deep: Continuous Human Activity Recognition Model Based on Deep Neural Network Using IMU Sensors of Smartwatch
    Ashry, Sara
    Ogawa, Tetsuji
    Gomaa, Walid
    [J]. IEEE SENSORS JOURNAL, 2020, 20 (15) : 8757 - 8770
  • [8] Human action recognition with bag of visual words using different machine learning methods and hyperparameter optimization
    Aslan, Muhammet Fatih
    Durdu, Akif
    Sabanci, Kadir
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12) : 8585 - 8597
  • [9] Local texton XOR patterns: A new feature descriptor for content-based image retrieval
    Bala, Anu
    Kaur, Tajinder
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (01): : 101 - 112
  • [10] Vision-based human activity recognition: a survey
    Beddiar, Djamila Romaissa
    Nini, Brahim
    Sabokrou, Mohammad
    Hadid, Abdenour
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 30509 - 30555