Enhancing Human Activity Detection and Classification Using Fine Tuned Attention-Based Transformer Models

被引:0
|
作者
Ram Kumar Yadav [1 ]
A. Daniel [1 ]
Vijay Bhaskar Semwal [2 ]
机构
[1] Amity University,Department of Computer Science and Engineering
[2] MANIT,Department of Computer Science and Engineering
关键词
Machine learning; Deep learning; HAR; Data augmentation; Attention-based transformers;
D O I
10.1007/s42979-024-03445-5
中图分类号
学科分类号
摘要
Recognition of human activity is an active research area. It uses the Internet of Things, Sensory methods, Machine Learning, and Deep Learning techniques to assist various application fields like home monitoring, robotics, surveillance, and healthcare. However, researchers face problems such as time complexity, more execution time of the model, and classification accuracy. This paper introduces a novel approach to overcome the issue as mentioned earlier by using the deep learning transformer model such as ViT(Vision Transformer), DieT(Data-efficient image Transformers), and SwinV2 transformer, which are used for image-based datasets (i.e., Standard40, MPII human pose) and VideoMAE transformer is used for video-based UCF101 and HMDB51 datasets. The approaches achieved remarkable accuracy in classifying human activities. Evaluations using the ViT, DeiT, and Swin transformer V2 with Stanford40 are 90.8%, 90.7%, and 88%; similarly, MPII Human Pose datasets show 87%, 85.6%, and 87.1%. In addition, this paper's method has achieved remarkable accuracies of 94.15% and 78.44%, respectively, when applying the VideoMAE transformer to video-based activity recognition on the UCF101 and HMDB51 datasets. These findings emphasize the efficacy of the attention-based transformer (i.e., ViT, DeiT, SwinV2, and VideoMAE) model and the novelty of earlier no-result evaluation on these various datasets with attention-based transformers.
引用
收藏
相关论文
共 50 条
  • [1] Fine-Tuned Understanding: Enhancing Social Bot Detection With Transformer-Based Classification
    Sallah, Amine
    Alaoui, El Arbi Abdellaoui
    Agoujil, Said
    Wani, Mudasir Ahmad
    Hammad, Mohamed
    Maleh, Yassine
    Abd El-Latif, Ahmed A.
    IEEE ACCESS, 2024, 12 : 118250 - 118269
  • [2] On Exploring Attention-based Explanation for Transformer Models in Text Classification
    Liu, Shengzhong
    Le, Franck
    Chakraborty, Supriyo
    Abdelzaher, Tarek
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1193 - 1203
  • [3] A Scaled Denoising Attention-Based Transformer for Breast Cancer Detection and Classification
    Junayed, Masum Shah
    Nabavi, Sheida
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 346 - 356
  • [4] Attention-Based Transformer-BiGRU for Question Classification
    Han, Dongfang
    Tohti, Turdi
    Hamdulla, Askar
    INFORMATION, 2022, 13 (05)
  • [5] Supremacy of attention-based transformer in oral cancer classification using histopathology images
    Deo, Bhaswati Singha
    Pal, Mayukha
    Panigrahi, Prasanta K.
    Pradhan, Asima
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [6] AN ATTENTION-BASED BACKEND ALLOWING EFFICIENT FINE-TUNING OF TRANSFORMER MODELS FOR SPEAKER VERIFICATION
    Peng, Junyi
    Plchot, Oldrich
    Stafylakis, Themos
    Mosner, Ladislav
    Burget, Lukas
    Cernocky, Jan
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 555 - 562
  • [7] A novel transformer attention-based approach for sarcasm detection
    Khan, Shumaila
    Qasim, Iqbal
    Khan, Wahab
    Aurangzeb, Khursheed
    Khan, Javed Ali
    Anwar, Muhammad Shahid
    EXPERT SYSTEMS, 2025, 42 (01)
  • [8] Enhancing intra-aural disease classification with attention-based deep learning models
    Furkancan Demircan
    Murat Ekinci
    Zafer Cömert
    Neural Computing and Applications, 2025, 37 (9) : 6601 - 6616
  • [9] Classification of Cleft Lip and Palate Speech Using Fine-Tuned Transformer Pretrained Models
    Bhattacharjee, Susmita
    Shekhawat, H. S.
    Prasanna, S. R. M.
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 55 - 61
  • [10] Attention-based sequence classification for affect detection
    Gorrostieta, Cristina
    Brutti, Richard
    Taylor, Kye
    Shapiro, Avi
    Moran, Joseph
    Azarbayejani, Ali
    Kane, John
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 506 - 510