Enhancing the Transformer Model with a Convolutional Feature Extractor Block and Vector-Based Relative Position Embedding for Human Activity Recognition

被引:0
作者
Guo, Xin [1 ]
Kim, Young [1 ]
Ning, Xueli [1 ]
Min, Se Dong [1 ,2 ]
机构
[1] Soonchunhyang Univ, Dept Software Convergence, Asan 31538, South Korea
[2] Soonchunhyang Univ, Med IT Engn, Asan 31538, South Korea
基金
新加坡国家研究基金会;
关键词
human activity recognition; inertial measurement units (IMUs); transformer model; relative position embedding; convolutional neural networks (CNNs); time series signal; ACCELEROMETER; NETWORKS;
D O I
10.3390/s25020301
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The Transformer model has received significant attention in Human Activity Recognition (HAR) due to its self-attention mechanism that captures long dependencies in time series. However, for Inertial Measurement Unit (IMU) sensor time-series signals, the Transformer model does not effectively utilize the a priori information of strong complex temporal correlations. Therefore, we proposed using multi-layer convolutional layers as a Convolutional Feature Extractor Block (CFEB). CFEB enables the Transformer model to leverage both local and global time series features for activity classification. Meanwhile, the absolute position embedding (APE) in existing Transformer models cannot accurately represent the distance relationship between individuals at different time points. To further explore positional correlations in temporal signals, this paper introduces the Vector-based Relative Position Embedding (vRPE), aiming to provide more relative temporal position information within sensor signals for the Transformer model. Combining these innovations, we conduct extensive experiments on three HAR benchmark datasets: KU-HAR, UniMiB SHAR, and USC-HAD. Experimental results demonstrate that our proposed enhancement scheme substantially elevates the performance of the Transformer model in HAR.
引用
收藏
页数:20
相关论文
共 62 条
  • [1] Abid Mahmudul Hasan, 2021, 2021 IEEE 6th International Conference on Computing, Communication and Automation (ICCCA), P933, DOI 10.1109/ICCCA52192.2021.9666294
  • [2] WiFi-based human activity recognition through wall using deep learning
    Abuhoureyah, Fahd Saad
    Wong, Yan Chiew
    Isira, Ahmad Sadhiqin Bin Mohd
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [3] WiFi-Based Driver's Activity Monitoring with Efficient Computation of Radio-Image Features
    Akhtar, Zain Ul Abiden
    Wang, Hongyu
    [J]. SENSORS, 2020, 20 (05)
  • [4] The Applications of Metaheuristics for Human Activity Recognition and Fall Detection Using Wearable Sensors: A Comprehensive Analysis
    Al-qaness, Mohammed A. A.
    Helmi, Ahmed M.
    Dahou, Abdelghani
    Abd Elaziz, Mohamed
    [J]. BIOSENSORS-BASEL, 2022, 12 (10):
  • [5] A Comparison of Unidirectional and Bidirectional LSTM Networks for Human Activity Recognition
    Alawneh, Luay
    Mohsen, Belal
    Al-Zinati, Mohammad
    Shatnawi, Ahmed
    Al-Ayyoub, Mahmoud
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [6] Bidirectional Gated Recurrent Units For Human Activity Recognition Using Accelerometer Data
    Alsarhan, Tamam
    Alawneh, Luay
    Al-Zinati, Mohammad
    Al-Ayyoub, Mahmoud
    [J]. 2019 IEEE SENSORS, 2019,
  • [7] Window Size Impact in Human Activity Recognition
    Banos, Oresti
    Galvez, Juan-Manuel
    Damas, Miguel
    Pomares, Hector
    Rojas, Ignacio
    [J]. SENSORS, 2014, 14 (04) : 6474 - 6499
  • [8] Human Activity Recognition Based on Dynamic Active Learning
    Bi, Haixia
    Perello-Nieto, Miquel
    Santos-Rodriguez, Raul
    Flach, Peter
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (04) : 922 - 934
  • [9] A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective
    Chen, Chaoqi
    Wu, Yushuang
    Dai, Qiyuan
    Zhou, Hong-Yu
    Xu, Mutian
    Yang, Sibei
    Han, Xiaoguang
    Yu, Yizhou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10297 - 10318
  • [10] Learning hierarchical time series data augmentation invariances via contrastive supervision for human activity recognition
    Cheng, Dongzhou
    Zhang, Lei
    Bu, Can
    Wu, Hao
    Song, Aiguo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 276