A Lightweight Multi-Scale Convolutional Neural Network for P300 Decoding: Analysis of Training Strategies and Uncovering of Network Decision

被引:21
作者
Borra, Davide [1 ]
Fantozzi, Silvia [1 ,2 ]
Magosso, Elisa [1 ,2 ,3 ]
机构
[1] Univ Bologna, Dept Elect Elect & Informat Engn Guglielmo Marcon, Cesena, Italy
[2] Univ Bologna, Interdept Ctr Ind Res Hlth Sci & Technol, Bologna, Italy
[3] Univ Bologna, Alma Mater Res Inst, Ctr Artificial Intelligence, Bologna, Italy
关键词
electroencephalography; P300; convolutional neural networks; transfer learning; decision explanation; brain-computer interfaces; AUTISM;
D O I
10.3389/fnhum.2021.655840
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Convolutional neural networks (CNNs), which automatically learn features from raw data to approximate functions, are being increasingly applied to the end-to-end analysis of electroencephalographic (EEG) signals, especially for decoding brain states in brain-computer interfaces (BCIs). Nevertheless, CNNs introduce a large number of trainable parameters, may require long training times, and lack in interpretability of learned features. The aim of this study is to propose a CNN design for P300 decoding with emphasis on its lightweight design while guaranteeing high performance, on the effects of different training strategies, and on the use of post-hoc techniques to explain network decisions. The proposed design, named MS-EEGNet, learned temporal features in two different timescales (i.e., multi-scale, MS) in an efficient and optimized (in terms of trainable parameters) way, and was validated on three P300 datasets. The CNN was trained using different strategies (within-participant and within-session, within-participant and cross-session, leave-one-subject-out, transfer learning) and was compared with several state-of-the-art (SOA) algorithms. Furthermore, variants of the baseline MS-EEGNet were analyzed to evaluate the impact of different hyper-parameters on performance. Lastly, saliency maps were used to derive representations of the relevant spatio-temporal features that drove CNN decisions. MS-EEGNet was the lightest CNN compared with the tested SOA CNNs, despite its multiple timescales, and significantly outperformed the SOA algorithms. Post-hoc hyper-parameter analysis confirmed the benefits of the innovative aspects of MS-EEGNet. Furthermore, MS-EEGNet did benefit from transfer learning, especially using a low number of training examples, suggesting that the proposed approach could be used in BCIs to accurately decode the P300 event while reducing calibration times. Representations derived from the saliency maps matched the P300 spatio-temporal distribution, further validating the proposed decoding approach. This study, by specifically addressing the aspects of lightweight design, transfer learning, and interpretability, can contribute to advance the development of deep learning algorithms for P300-based BCIs.
引用
收藏
页数:22
相关论文
共 50 条
[1]   A Feasibility Clinical Trial to Improve Social Attention in Autistic Spectrum Disorder (ASD) Using a Brain Computer Interface [J].
Amaral, Carlos ;
Mouga, Susana ;
Simoes, Marco ;
Pereira, Helena C. ;
Bernardino, Ines ;
Quental, Hugo ;
Playle, Rebecca ;
McNamara, Rachel ;
Oliveira, Guiomar ;
Castelo-Branco, Miguel .
FRONTIERS IN NEUROSCIENCE, 2018, 12
[2]   A novel Brain Computer Interface for classification of social joint attention in autism and comparison of 3 experimental setups: A feasibility study [J].
Amaral, Carlos P. ;
Simoes, Marco A. ;
Mouga, Susana ;
Andrade, Joao ;
Castelo-Branco, Miguel .
JOURNAL OF NEUROSCIENCE METHODS, 2017, 290 :105-115
[3]  
Barachant A., 2014, ARXIV PREPRINT ARXIV
[4]   Multiclass Brain-Computer Interface Classification by Riemannian Geometry [J].
Barachant, Alexandre ;
Bonnet, Stephane ;
Congedo, Marco ;
Jutten, Christian .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (04) :920-928
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   The BCI competition 2003:: Progress and perspectives in detection and discrimination of EEG single trials [J].
Blankertz, B ;
Müller, KR ;
Curio, G ;
Vaughan, TM ;
Schalk, G ;
Wolpaw, JR ;
Schlögl, A ;
Neuper, C ;
Pfurtscheller, G ;
Hinterberger, T ;
Schröder, M ;
Birbaumer, N .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2004, 51 (06) :1044-1051
[7]   The BCI competition III:: Validating alternative approaches to actual BCI problems [J].
Blankertz, Benjamin ;
Mueller, Klaus-Robert ;
Krusienski, Dean J. ;
Schalk, Gerwin ;
Wolpaw, Jonathan R. ;
Schloegl, Alois ;
Pfurtscheller, Gert ;
Millan, Jose D. R. ;
Schroeder, Michael ;
Birbaumer, Niels .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2006, 14 (02) :153-159
[8]   EEG Motor Execution Decoding via Interpretable Sinc-Convolutional Neural Networks [J].
Borra, Davide ;
Fantozzi, Silvia ;
Magosso, Elisa .
XV MEDITERRANEAN CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING AND COMPUTING - MEDICON 2019, 2020, 76 :1113-1122
[9]   Convolutional Neural Network for a P300 Brain-Computer Interface to Improve Social Attention in Autistic Spectrum Disorder [J].
Borra, Davide ;
Fantozzi, Silvia ;
Magosso, Elisa .
XV MEDITERRANEAN CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING AND COMPUTING - MEDICON 2019, 2020, 76 :1837-1843
[10]   Interpretable and lightweight convolutional neural network for EEG decoding: Application to movement execution and imagination [J].
Borra, Davide ;
Fantozzi, Silvia ;
Magosso, Elisa .
NEURAL NETWORKS, 2020, 129 :55-74