A CNN-Transformer Hybrid Recognition Approach for sEMG-Based Dynamic Gesture Prediction

被引:35
作者
Liu, Yanhong [1 ]
Li, Xingyu [1 ]
Yang, Lei [1 ]
Bian, Guibin [2 ,3 ]
Yu, Hongnian [1 ,4 ]
机构
[1] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Sch Elect Engn, Zhengzhou 450001, Henan, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[4] Edinburgh Napier Univ, Sch Engn & Built Environm, Edinburgh EH10 5DT, Scotland
基金
中国国家自然科学基金;
关键词
Feature extraction; Gesture recognition; Task analysis; Transformers; Time-frequency analysis; Convolution; Data mining; Convolutional neural network (CNN); feature fusion; hand gesture recognition; surface electromyography (sEMG) sensor; transformer; HAND; SIGNALS; ROBUST;
D O I
10.1109/TIM.2023.3273651
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a unique physiological electrical signal in the human body, surface electromyography (sEMG) signals always include human movement intention and muscle state. Through the collection of sEMG signals, different gestures can be effectively recognized. At present, the convolutional neural network (CNN) has been widely applied to different gesture recognition systems. However, due to its inherent limitations in global context feature extraction, it exists a certain shortcoming on high-precision prediction tasks. To solve this issue, a CNN-transformer hybrid recognition approach is proposed for high-precision dynamic gesture prediction. In addition, the continuous wavelet transform (CWT) is proposed for to acquire the time-frequency maps. To realize effective feature representation of local features from the time-frequency maps, an attention fusion block (AFB) is proposed to build the deep CNN network branch to effectively extract key channel information and spatial information from local features. Faced with the inherent limitations in global context feature extraction of CNNs, a transformer network branch is proposed to model the global relationship between pixels, called convolution and transformer (CAT) network branch. In addition, a multiscale feature attention (MFA) block is proposed for effective feature aggregation of local features and global contexts by learning adaptive multiscale features and suppressing irrelevant scale information. The experimental results on the established multichannel sEMG signal time-frequency map dataset show that the proposed CNN transformer hybrid recognition network has competitive recognition performance compared with other state-of-the-art recognition networks, and the average recognition speed of each spectrogram on the test set is only 14.7 ms. The proposed network can effectively improve network performance and identification efficiency.
引用
收藏
页数:16
相关论文
共 51 条
[1]   Improving the Performance Against Force Variation of EMG Controlled Multifunctional Upper-Limb Prostheses for Transradial Amputees [J].
Al-Timemy, Ali H. ;
Khushaba, Rami N. ;
Bugmann, Guido ;
Escudero, Javier .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2016, 24 (06) :650-661
[2]   An EMG-Based Robot Control Scheme Robust to Time-Varying EMG Signal Features [J].
Artemiadis, Panagiotis K. ;
Kyriakopoulos, Kostas J. .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (03) :582-588
[3]   Toward the Minimum Number of Wearables to Recognize Signer-Independent Italian Sign Language With Machine-Learning Algorithms [J].
Calado, Alexandre ;
Errico, Vito ;
Saggio, Giovanni .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70 (70)
[4]   Hand Gesture Recognition based on Surface Electromyography using Convolutional Neural Network with Transfer Learning Method [J].
Chen, Xiang ;
Li, Yu ;
Hu, Ruochen ;
Zhang, Xu ;
Chen, Xun .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (04) :1292-1304
[5]   Deep Learning for Electromyographic Hand Gesture Signal Classification Using Transfer Learning [J].
Cote-Allard, Ulysse ;
Fall, Cheikh Latyr ;
Drouin, Alexandre ;
Campeau-Lecours, Alexandre ;
Gosselin, Clement ;
Glette, Kyrre ;
Laviolette, Francois ;
Gosselin, Benoit .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2019, 27 (04) :760-771
[6]   Unsupervised Trajectory Segmentation for Surgical Gesture Recognition in Robotic Training [J].
Despinoy, Fabien ;
Bouget, David ;
Forestier, Germain ;
Penet, Cedric ;
Zemiti, Nabil ;
Poignet, Philippe ;
Jannin, Pierre .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2016, 63 (06) :1280-1291
[7]   An Instrumented Glove for Recognition of Brazilian Sign Language Alphabet [J].
Dias, Thiago Simoes ;
Alves Mendes Junior, Jose Jair ;
Pichorim, Sergio Francisco .
IEEE SENSORS JOURNAL, 2022, 22 (03) :2518-2529
[8]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]   A Novel Emotion-Aware Method Based on the Fusion of Textual Description of Speech, Body Movements, and Facial Expressions [J].
Du, Guanglong ;
Zeng, Yuwen ;
Su, Kang ;
Li, Chunquan ;
Wang, Xueqian ;
Teng, Shaohua ;
Li, Di ;
Liu, Peter Xiaoping .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[10]   A Gesture Recognition System Based on Time Domain Features and Linear Discriminant Analysis [J].
Duan, Feng ;
Ren, Xina ;
Yang, Yikang .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (01) :200-208