Mutual Information-Driven Subject-Invariant and Class-Relevant Deep Representation Learning in BCI

被引:56
作者
Jeon, Eunjin [1 ]
Ko, Wonjun [1 ]
Yoon, Jee Seok [1 ]
Suk, Heung-Il [1 ,2 ]
机构
[1] Korea Univ, Dept Brain & Cognit Engn, Seoul 02841, South Korea
[2] Korea Univ, Dept Artificial Intelligence, Seoul 02841, South Korea
关键词
Electroencephalography; Feature extraction; Training; Mutual information; Brain modeling; Transfer learning; Decoding; Brain-computer interface (BCI); deep learning; domain adaptation; electroencephalogram; motor imagery; mutual information; subject-independent; transfer learning; NEURAL-NETWORKS; EEG; FRAMEWORK; PATTERNS;
D O I
10.1109/TNNLS.2021.3100583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning-based feature representation methods have shown a promising impact on electroencephalography (EEG)-based brain-computer interface (BCI). Nonetheless, owing to high intra- and inter-subject variabilities, many studies on decoding EEG were designed in a subject-specific manner by using calibration samples, with no concern of its practical use, hampered by time-consuming steps and a large data requirement. To this end, recent studies adopted a transfer learning strategy, especially domain adaptation techniques. Among those, we have witnessed the potential of adversarial learning-based transfer learning in BCIs. In the meantime, it is known that adversarial learning-based domain adaptation methods are prone to negative transfer that disrupts learning generalized feature representations, applicable to diverse domains, for example, subjects or sessions in BCIs. In this article, we propose a novel framework that learns class-relevant and subject-invariant feature representations in an information-theoretic manner, without using adversarial learning. To be specific, we devise two operational components in a deep network that explicitly estimate mutual information between feature representations: 1) to decompose features in an intermediate layer into class-relevant and class-irrelevant ones and 2) to enrich class-discriminative feature representation. On two large EEG datasets, we validated the effectiveness of our proposed framework by comparing with several comparative methods in performance. Furthermore, we conducted rigorous analyses by performing an ablation study in regard to the components in our network, explaining our model's decision on input EEG signals via layer-wise relevance propagation, and visualizing the distribution of learned features via t-SNE.
引用
收藏
页码:739 / 749
页数:11
相关论文
共 45 条
[1]   Separable Common Spatio-Spectral Patterns for Motor Imagery BCI Systems [J].
Aghaei, Amirhossein S. ;
Mahanta, Mohammad Shahin ;
Plataniotis, Konstantinos N. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2016, 63 (01) :15-29
[2]  
Ang KK, 2008, IEEE IJCNN, P2390, DOI 10.1109/IJCNN.2008.4634130
[3]  
[Anonymous], 2019, I IEEE EMBS C NEUR E
[4]   Weighted Transfer Learning for Improving Motor Imagery-Based Brain-Computer Interface [J].
Azab, Ahmed M. ;
Mihaylova, Lyudmila ;
Ang, Kai Keng ;
Arvaneh, Mahnaz .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2019, 27 (07) :1352-1359
[5]   On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].
Bach, Sebastian ;
Binder, Alexander ;
Montavon, Gregoire ;
Klauschen, Frederick ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2015, 10 (07)
[6]  
Belghazi MI, 2018, PR MACH LEARN RES, V80
[7]   Unsupervised domain adaptation techniques based on auto-encoder for non-stationary EEG-based emotion recognition [J].
Chai, Xin ;
Wang, Qisong ;
Zhao, Yongping ;
Liu, Xin ;
Bai, Ou ;
Li, Yongqiang .
COMPUTERS IN BIOLOGY AND MEDICINE, 2016, 79 :205-214
[8]   EEG datasets for motor imagery brain-computer interface [J].
Cho, Hohyun ;
Ahn, Minkyu ;
Ahn, Sangtae ;
Kwon, Moonyoung ;
Jun, Sung Chan .
GIGASCIENCE, 2017, 6 (07) :1-8
[9]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[10]   ASYMPTOTIC EVALUATION OF CERTAIN MARKOV PROCESS EXPECTATIONS FOR LARGE TIME .4. [J].
DONSKER, MD ;
VARADHAN, SRS .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1983, 36 (02) :183-212