Electromyography-Based Gesture Recognition With Explainable AI (XAI): Hierarchical Feature Extraction for Enhanced Spatial-Temporal Dynamics

被引:0
作者
Shin, Jungpil [1 ]
Miah, Abu Saleh Musa [1 ]
Konnai, Sota [1 ]
Hoshitaka, Shu [1 ]
Kim, Pankoo [2 ]
机构
[1] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu 9658580, Japan
[2] Chosun Univ, Dept Comp Engn, Gwangju 61452, South Korea
关键词
Feature extraction; Hands; Gesture recognition; Accuracy; Convolutional neural networks; Electromyography; Robustness; Muscles; Support vector machines; Deep learning; Hand gesture recognition; electromyography (EMG); deep learning; temporal convolutional network (TCN);
D O I
10.1109/ACCESS.2025.3569899
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hand gesture recognition using multichannel surface electromyography (sEMG) is challenging due to unstable predictions and inefficient time-varying feature enhancement. To overcome the lack of signal-based time-varying feature problems, we propose a lightweight squeeze-excitation deep learning-based multi-stream spatial-temporal dynamics time-varying feature extraction approach to build an effective sEMG-based hand gesture recognition system. Each branch of the proposed model was designed to extract hierarchical features, capturing both global and detailed spatial-temporal relationships to ensure feature effectiveness. The first branch, utilizing a Bidirectional-TCN (Bi-TCN), focuses on capturing long-term temporal dependencies by modelling past and future temporal contexts, providing a holistic view of gesture dynamics. The second branch, incorporating a 1D Convolutional layer, separable CNN, and Squeeze-and-Excitation (SE) block, efficiently extracts spatial-temporal features while emphasizing critical feature channels, enhancing feature relevance. The third branch, combining a Temporal Convolutional Network (TCN) and Bidirectional LSTM (BiLSTM), captures bidirectional temporal relationships and time-varying patterns. Outputs from all branches are fused using concatenation to capture subtle variations in the data and then refined with a channel attention module, selectively focusing on the most informative features while improving computational efficiency. The proposed model was tested on the Ninapro DB2, DB4, and DB5 datasets, achieving accuracy rates of 95.31%, 92.40%, and 93.34%, respectively. Additionally, we visualize the attention maps across various classes. These results demonstrate the system's capability to handle complex sEMG dynamics, offering advancements in prosthetic limb control and human-machine interface technologies with significant implications for assistive technologies.
引用
收藏
页码:88930 / 88951
页数:22
相关论文
共 73 条
[1]   Deep Learning with Convolutional Neural Networks Applied to Electromyography Data: A Resource for the Classification of Movements for Prosthetic Hands [J].
Atzori, Manfredo ;
Cognolato, Matteo ;
Mueller, Henning .
FRONTIERS IN NEUROROBOTICS, 2016, 10
[2]   Electromyography data for non-invasive naturally-controlled robotic hand prostheses [J].
Atzori, Manfredo ;
Gijsberts, Arjan ;
Castellini, Claudio ;
Caputo, Barbara ;
Hager, Anne-Gabrielle Mittaz ;
Elsig, Simone ;
Giatsidis, Giorgio ;
Bassetto, Franco ;
Muller, Henning .
SCIENTIFIC DATA, 2014, 1
[3]   Characterization of a Benchmark Database for Myoelectric Movement Classification [J].
Atzori, Manfredo ;
Gijsberts, Arjan ;
Kuzborskij, Ilja ;
Elsig, Simone ;
Hager, Anne-Gabrielle Mittaz ;
Deriaz, Olivier ;
Castellini, Claudio ;
Mueller, Henning ;
Caputo, Barbara .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2015, 23 (01) :73-83
[4]  
Bai SJ, 2018, Arxiv, DOI [arXiv:1803.01271, DOI 10.48550/ARXIV.1803.01271]
[5]  
Beckerle P., 2017, P 1 WORKSH PER MACH, P1
[6]   Stable Responsive EMG Sequence Prediction and Adaptive Reinforcement With Temporal Convolutional Networks [J].
Betthauser, Joseph L. ;
Krall, John T. ;
Bannowsky, Shain G. ;
Levay, Gyorgy ;
Kaliki, Rahul R. ;
Fifer, Matthew S. ;
Thakor, Nitish, V .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2020, 67 (06) :1707-1717
[7]  
Betthauser JL, 2019, I IEEE EMBS C NEUR E, P1046, DOI [10.1109/ner.2019.8717169, 10.1109/NER.2019.8717169]
[8]  
Chaiyaroj A, 2019, IEEE INT C BIOINFORM, P1443, DOI [10.1109/bibm47256.2019.8983049, 10.1109/BIBM47256.2019.8983049]
[9]  
Chattopadhyay R., 2011, Proceedings 2011 IEEE International Instrumentation and Measurement Technology Conference (I2MTC 2011), DOI 10.1109/IMTC.2011.5944173
[10]   Two-dimensional discrete feature based spatial attention CapsNet For sEMG signal recognition [J].
Chen, Guoqi ;
Wang, Wanliang ;
Wang, Zheng ;
Liu, Honghai ;
Zang, Zelin ;
Li, Weikun .
APPLIED INTELLIGENCE, 2020, 50 (10) :3503-3520