Recognizing human activities with the use of Convolutional Block Attention Module

被引:1
作者
Zakariah, Mohammed [1 ]
Alnuaim, Abeer [1 ]
机构
[1] King Saud Univ, Coll Appl Studies & Community Serv, Dept Comp Sci & Engn, POB 22459, Riyadh 11495, Saudi Arabia
关键词
Human activity recognition; Human behaviour recognition; Deep-learning; Convolutional Block Attention Module (CBAM); Convolution Neural Network; Spatial Attention Module; HUMAN ACTION RECOGNITION;
D O I
10.1016/j.eij.2024.100536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human Activity Recognition (HAR) is crucial for the advancement of applications in smart environments, communication, IoT, security, and healthcare monitoring. Convolutional neural networks (CNNs) have made substantial contributions to human activity recognition (HAR). However, they frequently encounter difficulties in accurately discerning intricate human actions in real-time situations. This study aims to fill a significant research gap by incorporating the Convolutional Block Attention Module (CBAM) into CNN architectures. The goal is to improve the extraction of features from video sequences. The CBAM boosts the performance of the network by selectively prioritizing significant spatial and channel-wise data, resulting in improved detection of subtle activity patterns and increased stability in categorization. CBAM's attention mechanism directly focuses and amplifies essential characteristics, which sets it apart from typical CNNs that lack a refined focus mechanism. This unique approach results in improved performance in behavior identification tests. The proposed CBAMenhanced model has been extensively tested on benchmark datasets, yielding an accuracy of 94.23% on the HMDB51 dataset. It also achieved competitive results of 83.4% and 88.9% on the UCF-101 and UCF-50 datasets, respectively. However, there is still a lack of study in comprehending how CBAM adjusts to different CNN architectures and its suitability in varied HAR situations beyond controlled datasets. In future studies, it is imperative for researchers to investigate the integration of CBAM with other CNN frameworks, assess its efficacy in practical scenarios, and explore multi-modal sensor fusion techniques to enhance its reliability and utility. This study showcases the ability of CBAM to enhance HAR capabilities and also paves the way for future research to improve activity identification systems for wider and more practical uses.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Smart Lifelogging: Recognizing Human Activities using PHASOR
    Minh-Son Dao
    Duc-Tien Dang-Nguyen
    Riegler, Michael
    Gurrin, Cathal
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 761 - 768
  • [42] A parallel multi-scale time-frequency block convolutional neural network based on channel attention module for motor imagery classification
    Li, Hongli
    Chen, Hongyu
    Jia, Ziyu
    Zhang, Ronghua
    Yin, Feichao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [43] Deep Learning Models for Recognizing the Simple Human Activities Using Smartphone Accelerometer Sensor
    Kumar, Prabhat
    Suresh, S.
    IETE JOURNAL OF RESEARCH, 2023, 69 (08) : 5148 - 5158
  • [44] An efficient attention module for 3d convolutional neural networks in action recognition
    Jiang, Guanghao
    Jiang, Xiaoyan
    Fang, Zhijun
    Chen, Shanshan
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7043 - 7057
  • [45] DSCANet: underwater acoustic target classification using the depthwise separable convolutional attention module
    Tang, Chonghua
    Hu, Gang
    EARTH SCIENCE INFORMATICS, 2024, 17 (06) : 6123 - 6135
  • [46] A Symmetric Efficient Spatial and Channel Attention (ESCA) Module Based on Convolutional Neural Networks
    Liu, Huaiyu
    Zhang, Yueyuan
    Chen, Yiyang
    SYMMETRY-BASEL, 2024, 16 (08):
  • [47] An efficient attention module for 3d convolutional neural networks in action recognition
    Guanghao Jiang
    Xiaoyan Jiang
    Zhijun Fang
    Shanshan Chen
    Applied Intelligence, 2021, 51 : 7043 - 7057
  • [48] Recognizing Human Activities Based on Multi- sensors Fusion
    Rong, Liu
    Ming, Liu
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [49] Recognizing human-human interaction activities using visual and textual information
    Cho, Sunyoung
    Kwak, Sooyeong
    Byun, Hyeran
    PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1840 - 1848
  • [50] Sensors-based Human Activity Recognition with Convolutional Neural Network and Attention Mechanism
    Zhang, Wenbo
    Zhu, Tao
    Yang, Congmin
    Xiao, Jiyi
    Ning, Huansheng
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 158 - 162