Deep Correlation Analysis for Audio-EEG Decoding

被引:16
作者
Katthi, Jaswanth Reddy [1 ]
Ganapathy, Sriram [1 ]
机构
[1] Indian Inst Sci, Elect Engn Dept, Learning & Extract Acoust Patterns LEAP Lab, Bengaluru 560012, India
关键词
Electroencephalography; Brain modeling; Correlation; Transforms; Analytical models; Task analysis; Deep learning; Canonical correlation analysis (CCA); multiway CCA (MCCA); deep learning; audio-EEG analysis; NEURAL-NETWORKS; REPRESENTATION; SPEECH; BRAIN;
D O I
10.1109/TNSRE.2021.3129790
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The electroencephalography (EEG), which is one of the easiest modes of recording brain activations in a non-invasive manner, is often distorted due to recording artifacts which adversely impacts the stimulus-response analysis. The most prominent techniques thus far attempt to improve the stimulus-response correlations using linear methods. In this paper, we propose a neural network based correlation analysis framework that significantly improves over the linear methods for auditory stimuli. A deep model is proposed for intra-subject audio-EEG analysis based on directly optimizing the correlation loss. Further, a neural network model with a shared encoder architecture is proposed for improving the inter-subject stimulus response correlations. These models attempt to suppress the EEG artifacts while preserving the components related to the stimulus. Several experiments are performed using EEG recordings from subjects listening to speech and music stimuli. In these experiments, we show that the deep models improve the Pearson correlation significantly over the linear methods (average absolute improvements of 7.4% in speech tasks and 29.3% in music tasks). We also analyze the impact of several model parameters on the stimulus-response correlation.
引用
收藏
页码:2742 / 2753
页数:12
相关论文
共 50 条
[1]   Adjusting for multiple testing when reporting research results: The Bonferroni vs Holm methods [J].
Aickin, M ;
Gensler, H .
AMERICAN JOURNAL OF PUBLIC HEALTH, 1996, 86 (05) :726-728
[2]   A Tutorial on Auditory Attention Identification Methods [J].
Alickovic, Emina ;
Lunner, Thomas ;
Gustafsson, Fredrik ;
Ljung, Lennart .
FRONTIERS IN NEUROSCIENCE, 2019, 13
[3]   Large-scale brain networks emerge from dynamic processing of musical timbre, key and rhythm [J].
Alluri, Vinoo ;
Toiviainen, Petri ;
Jaaskelainen, Iiro P. ;
Glerean, Enrico ;
Sams, Mikko ;
Brattico, Elvira .
NEUROIMAGE, 2012, 59 (04) :3677-3689
[4]  
Andrienko G., 2013, Introduction, P1
[5]  
[Anonymous], 2007, EEG signal processing, DOI DOI 10.1002/9780470511923
[6]  
[Anonymous], 2012, ARXIV PREPRINT ARXIV
[7]   Electrophysiological Correlates of Semantic Dissimilarity Reflect the Comprehension of Natural, Narrative Speech [J].
Broderick, Michael P. ;
Anderson, Andrew J. ;
Di Liberto, Giovanni M. ;
Crosse, Michael J. ;
Lalor, Edmund C. .
CURRENT BIOLOGY, 2018, 28 (05) :803-+
[8]   Convolutional Neural Networks for P300 Detection with Application to Brain-Computer Interfaces [J].
Cecotti, Hubert ;
Graeser, Axel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :433-445
[9]  
Cohen J., 1988, STAT POWER ANAL BEHA, DOI 10.4324/9780203771587
[10]   Multi-set canonical correlation analysis for the fusion of concurrent single trial ERP and functional MRI [J].
Correa, Nicolle M. ;
Eichele, Tom ;
Adali, Tulay ;
Li, Yi-Ou ;
Calhoun, Vince D. .
NEUROIMAGE, 2010, 50 (04) :1438-1445