Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications

被引：169

作者：

Mirkovic, Bojana ^{[1
,3
]}

Debener, Stefan ^{[1
,3
,4
]}

Jaeger, Manuela ^{[1
]}

De Vos, Maarten ^{[2
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Dept Psychol, Neuropsychol Lab, D-26129 Oldenburg, Germany

[2] Univ Oxford, Dept Engn, Inst Biomed Engn, Oxford OX3 7DQ, England

[3] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4all, D-26129 Oldenburg, Germany

[4] Carl von Ossietzky Univ Oldenburg, Res Ctr Neurosensory Sci, D-26129 Oldenburg, Germany

来源：

JOURNAL OF NEURAL ENGINEERING | 2015年 / 12卷 / 04期

关键词：

cocktail party; selective attention; speech envelope; stimulus reconstruction; mobile EEG; BCI; BRAIN-COMPUTER INTERFACE; AUDITORY OBJECTS; RESPONSES; CUES;

D O I：

10.1088/1741-2560/12/4/046007

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Objective. Recent studies have provided evidence that temporal envelope driven speech decoding from high-density electroencephalography (EEG) and magnetoencephalography recordings can identify the attended speech stream in a multi-speaker scenario. The present work replicated the previous high density EEG study and investigated the necessary technical requirements for practical attended speech decoding with EEG. Approach. Twelve normal hearing participants attended to one out of two simultaneously presented audiobook stories, while high density EEG was recorded. An offline iterative procedure eliminating those channels contributing the least to decoding provided insight into the necessary channel number and optimal cross-subject channel configuration. Aiming towards the future goal of near real-time classification with an individually trained decoder, the minimum duration of training data necessary for successful classification was determined by using a chronological cross-validation approach. Main results. Close replication of the previously reported results confirmed the method robustness. Decoder performance remained stable from 96 channels down to 25. Furthermore, for less than 15 min of training data, the subject-independent (pre-trained) decoder performed better than an individually trained decoder did. Significance. Our study complements previous research and provides information suggesting that efficient low-density EEG online decoding is within reach.

引用

页数：9

共 37 条

[1] Human cortical responses to the speech envelope [J].