EEG-Informed Attended Speaker Extraction From Recorded Speech Mixtures With Application in Neuro-Steered Hearing Prostheses

被引:92
作者
Van Eyndhoven, Simon [1 ,2 ]
Francart, Tom [3 ]
Bertrand, Alexander [1 ,2 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT, Stadius Ctr Dynam Syst Signal Proc & Data Analyt, Kasteelpk Arenberg 10,Box 2446, B-3001 Leuven, Belgium
[2] iMinds Med Informat Technol, Leuven, Belgium
[3] Katholieke Univ Leuven, Res Grp Expt Otorhinolaryngol, Dept Neurosci, Leuven, Belgium
关键词
Auditory attention detection (AAD); auditory prostheses; blind source separation (BSS); brain-computer interface; EEG signal processing; multichannel Wiener filter (MWF); speech enhancement; DAILY-LIFE APPLICATIONS; COCKTAIL PARTY; NOISE-REDUCTION; ENVIRONMENT; ALGORITHMS; SINGLE;
D O I
10.1109/TBME.2016.2587382
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: We aim to extract and denoise the attended speaker in a noisy two-speaker acoustic scenario, relying on microphone array recordings from a binaural hearing aid, which are complemented with electroencephalography (EEG) recordings to infer the speaker of interest. Methods: In this study, we propose a modular processing flow that first extracts the two speech envelopes from the microphone recordings, then selects the attended speech envelope based on the EEG, and finally uses this envelope to inform a multichannel speech separation and denoising algorithm. Results: Strong suppression of interfering (unattended) speech and background noise is achieved, while the attended speech is preserved. Furthermore, EEG-based auditory attention detection (AAD) is shown to be robust to the use of noisy speech signals. Conclusions: Our results show that AADbased speaker extraction from microphone array recordings is feasible and robust, even in noisy acoustic environments, and without access to the clean speech signals to perform EEG-based AAD. Significance: Current research on AAD always assumes the availability of the clean speech signals, which limits the applicability in real settings. We have extended this research to detect the attended speaker even when only microphone recordings with noisy speech mixtures are available. This is an enabling ingredient for new brain-computer interfaces and effective filtering schemes in neuro-steered hearing prostheses. Here, we provide a first proof of concept for EEG-informed attended speaker extraction and denoising.
引用
收藏
页码:1045 / 1056
页数:12
相关论文
共 27 条
[1]   Human cortical responses to the speech envelope [J].
Aiken, Steven J. ;
Picton, Terence W. .
EAR AND HEARING, 2008, 29 (02) :139-157
[2]   Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling [J].
Akram, Sahar ;
Presacco, Alessandro ;
Simon, Jonathan Z. ;
Shamma, Shihab A. ;
Babadi, Behtash .
NEUROIMAGE, 2016, 124 :906-917
[3]  
Aroudi A, 2016, INT CONF ACOUST SPEE, P694, DOI 10.1109/ICASSP.2016.7471764
[4]  
Bertrand A., 2010, INT WORKSH AC ECH NO
[5]   Distributed Signal Processing for Wireless EEG Sensor Networks [J].
Bertrand, Alexander .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2015, 23 (06) :923-935
[6]   ENERGY-BASED MULTI-SPEAKER VOICE ACTIVITY DETECTION WITH AN AD HOC MICROPHONE ARRAY [J].
Bertrand, Alexander ;
Moonen, Marc .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :85-88
[7]   Blind separation of non-negative source signals using multiplicative updates and subspace projection [J].
Bertrand, Alexander ;
Moonen, Marc .
SIGNAL PROCESSING, 2010, 90 (10) :2877-2890
[8]  
Biesmans W., 2016, IEEE T NEUR SYS REH, P1
[9]   Exploring miniaturized EEG electrodes for brain-computer interfaces. An EEG you do not see? [J].
Bleichner, Martin G. ;
Lundbeck, Micha ;
Selisky, Matthias ;
Minow, Falk ;
Jaeger, Manuela ;
Emkes, Reiner ;
Debener, Stefan ;
De Vos, Maarten .
PHYSIOLOGICAL REPORTS, 2015, 3 (04)
[10]   Wearable Electroencephalography What Is It, Why Is It Needed, and What Does It Entail? [J].
Casson, Alexander J. ;
Yates, David C. ;
Smith, Shelagh J. M. ;
Duncan, John S. ;
Rodriguez-Villegas, Esther .
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2010, 29 (03) :44-56