A transfer learning-based CNN and LSTM hybrid deep learning model to classify motor imagery EEG signals

被引:100
作者
Khademi, Zahra [1 ]
Ebrahimi, Farideh [1 ]
Kordy, Hussain Montazery [1 ]
机构
[1] Babol Noshirvani Univ Technol, Fac Elect & Comp Engn, Shariati Ave, Babol, Iran
关键词
BCI; MI; Hybrid neural network; Convolutional neural network; LSTM; Transfer learning;
D O I
10.1016/j.compbiomed.2022.105288
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the Motor Imagery (MI)-based Brain Computer Interface (BCI), users' intention is converted into a control signal through processing a specific pattern in brain signals reflecting motor characteristics. There are such restrictions as the limited size of the existing datasets and low signal to noise ratio in the classification of MI Electroencephalogram (EEG) signals. Machine learning (ML) methods, particularly Deep Learning (DL), have overcome these limitations relatively. In this study, three hybrid models were proposed to classify the EEG signal in the MI-based BCI. The proposed hybrid models consist of the convolutional neural networks (CNN) and the Long-Short Term Memory (LSTM). In the first model, the CNN with different number of convolutional-pooling blocks (from shallow to deep CNN) was examined; a two-block CNN model not affected by the vanishing gradient descent and yet able to extract desirable features employed; the second and third models contained pre-trained CNNs conducing to the exploration of more complex features. The transfer learning strategy and data augmentation methods were applied to overcome the limited size of the datasets by transferring learning from one model to another. This was achieved by employing two powerful pre-trained convolutional neural networks namely ResNet-50 and Inception-v3. The continuous wavelet transform (CWT) was used to generate images for the CNN. The performance of the proposed models was evaluated on the BCI Competition IV dataset 2a. The mean accuracy vlaues of 86%, 90%, and 92%, and mean Kappa values of 81%, 86%, and 88% were obtained for the hybrid neural network with the customized CNN, the hybrid neural network with ResNet-50 and the hybrid neural network with Inception-v3, respectively. Despite the promising performance of the three proposed models, the hybrid neural network with Inception-v3 outperformed the two other models. The best obtained result in the present study improved the previous best result in the literature by 7% in terms of classification accuracy. From the findings, it can be concluded that transfer learning based on a pre-trained CNN in combi-nation with LSTM is a novel method in MI-based BCI. The study also has implications for the discrimination of motor imagery tasks in each EEG recording channel and in different brain regions which can reduce computa-tional time in future works by only selecting the most effective channels.
引用
收藏
页数:14
相关论文
共 55 条
[1]  
Alfadda Assim A, 2014, Int J Endocrinol, V2014, P794943, DOI [10.1155/2014/730218, 10.1155/2014/794943]
[2]   Deep Learning for EEG motor imagery classification based on multi-layer CNNs feature fusion [J].
Amin, Syed Umar ;
Alsulaiman, Mansour ;
Muhammad, Ghulam ;
Mekhtiche, Mohamed Amine ;
Hossain, M. Shamim .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 :542-554
[3]   Multilevel Weighted Feature Fusion Using Convolutional Neural Networks for EEG Motor Imagery Classification [J].
Amin, Syed Umar ;
Alsulaiman, Mansour ;
Muhammad, Ghulam ;
Bencherif, Mohamed A. ;
Hossain, M. Shamim .
IEEE ACCESS, 2019, 7 :18940-18950
[4]  
Ang K.K., 2008, Filter Bank Common Spatial Pattern (FBCSP) in Brain-Computer Interface, DOI DOI 10.1109/IJCNN.2008.4634130
[5]   Filter bank common spatial pattern algorithm on BCI competition IV Datasets 2a and 2b [J].
Ang, Kai Keng ;
Chin, Zheng Yang ;
Wang, Chuanchu ;
Guan, Cuntai ;
Zhang, Haihong .
FRONTIERS IN NEUROSCIENCE, 2012, 6
[6]  
Azmoudeh B, 2019, ENCYCLOPEDIA OF BIOMEDICAL ENGINEERING, VOL 3, P193, DOI 10.1016/B978-0-12-801238-3.99972-0
[7]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[8]  
Bengio Y., 2000, MODELING HIGH DIMENS
[9]   Soft Computing-Based EEG Classification by Optimal Feature Selection and Neural Networks [J].
Bhatti, Muhammad Hamza ;
Khan, Javeria ;
Khan, Muhammad Usman Ghani ;
Iqbal, Razi ;
Aloqaily, Moayad ;
Jararweh, Yaser ;
Gupta, Brij .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (10) :5747-5754
[10]   Deep learning models for brain machine interfaces [J].
Bozhkov, Lachezar ;
Georgieva, Petia .
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2020, 88 (11-12) :1175-1190