Convolutional Neural Network With Multihead Attention for Human Activity Recognition

被引:3
|
作者
Tan, Tan-Hsu [1 ]
Chang, Yang-Lang [1 ]
Wu, Jun-Rong [1 ]
Chen, Yung-Fu [2 ,3 ]
Alkhaleefah, Mohammad [1 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
[2] Cent Taiwan Univ Sci & Technol, Dept Dent Technol & Mat Sci, Taichung 40601, Taiwan
[3] China Med Univ, Dept Hlth Serv Adm, Taichung 404, Taiwan
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 02期
关键词
Convolutional neural network (CNN); deep learning; human activity recognition (HAR); Internet of Things (IoT); multihead attention (MHA) mechanism;
D O I
10.1109/JIOT.2023.3294421
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have shown great promise in human activity recognition (HAR), but long-term dependencies in time series data can be difficult to capture using standard CNNs. This study introduces a new CNN architecture that incorporates a multihead attention mechanism (CNN-MHA) to address this challenge. This mechanism is composed of several attention heads, each independently calculating attention weights for distinct segments of the input. The attention head outputs are then concatenated and processed through a fully connected layer to produce the final attention representation. A multihead attention (MHA) mechanism allows the network to focus on relevant features and maintain long-term dependencies in the input data. The proposed model is evaluated on the physical activity monitoring for aging people data set (PAMAP2) from the UCI machine learning repository, which is preprocessed by cleaning, normalization, segmentation, and reshaping before splitting into training, validation, and testing sets. The experimental results demonstrate that the CNN-MHA model outperforms existing models, achieving F1-score of 95.7%. Particularly, the MHA mechanism significantly improves the model's ability to recognize complex activity patterns. Furthermore, our model attained an average inference latency of 0.304 s, which can be crucial in real-time applications. The findings clearly demonstrate the substantial promise of the proposed CNN-MHA architecture for optimizing HAR tasks, offering a powerful tool for advancing the state-of-the-art in this domain.
引用
收藏
页码:3032 / 3043
页数:12
相关论文
共 50 条
  • [11] A Convolutional Neural Network for Smoking Activity Recognition
    Alharbi, Fayez
    Farrahi, Katayoun
    2018 IEEE 20TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2018,
  • [12] A time-efficient convolutional neural network model in human activity recognition
    Marjan Gholamrezaii
    SMT AlModarresi
    Multimedia Tools and Applications, 2021, 80 : 19361 - 19376
  • [13] Human Activity Recognition with Convolutional Neural Networks
    Bevilacqua, Antonio
    MacDonald, Kyle
    Rangarej, Aamina
    Widjaya, Venessa
    Caulfield, Brian
    Kechadi, Tahar
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 : 541 - 552
  • [14] Human Activity Recognition Based on Multichannel Convolutional Neural Network With Data Augmentation
    Shi, Wenbing
    Fang, Xianjin
    Yang, Gaoming
    Huang, Ji
    IEEE ACCESS, 2022, 10 : 76596 - 76606
  • [15] A time-efficient convolutional neural network model in human activity recognition
    Gholamrezaii, Marjan
    AlModarresi, S. M. T.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) : 19361 - 19376
  • [16] Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition
    Nandagopal, S.
    Karthy, G.
    Oliver, A. Sheryl
    Subha, M.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1719 - 1733
  • [17] Convolutional Neural Network With Attention Mechanism for SAR Automatic Target Recognition
    Zhang, Ming
    An, Jubai
    Yu, Da Hua
    Yang, Li Dong
    Wu, Liang
    Lu, Xiao Qi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [18] Human activity recognition and fall detection using convolutional neural network and transformer-based architecture
    Al-qaness, Mohammed A. A.
    Dahou, Abdelghani
    Abd Elaziz, Mohamed
    Helmi, Ahmed M.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [19] CJAM: Convolutional Neural Network Joint Attention Mechanism in Gait Recognition
    Jia, Pengtao
    Zhao, Qi
    Li, Boze
    Zhang, Jing
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (08) : 1239 - 1249
  • [20] Micro-Expression Recognition Using Convolutional Variational Attention Transformer (ConVAT) With Multihead Attention Mechanism
    Bin Talib, Hafiz Khizer
    Xu, Kaiwei
    Cao, Yanlong
    Xu, Yuan Ping
    Xu, Zhijie
    Zaman, Muhammad
    Akhunzada, Adnan
    IEEE ACCESS, 2025, 13 : 20054 - 20070