Cognitive-Driven Binaural Beamforming Using EEG-Based Auditory Attention Decoding

被引：32

作者：

Aroudi, Ali ^{[1
,2
]}

Doclo, Simon ^{[1
,2
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, D-26129 Oldenburg, Germany

[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4all, D-26129 Oldenburg, Germany

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2020年 / 28卷

关键词：

Auditory attention decoding (AAD); steerable binaural beamformer; speech enhancement; direction-of-arrival estimation; EEG signal; brain computer interface; SPEECH ENHANCEMENT; NOISE-REDUCTION; MULTICHANNEL; ENVIRONMENT; ALGORITHMS; TRACKING;

D O I：

10.1109/TASLP.2020.2969779

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Identifying the target speaker in hearing aid applications is an essential ingredient to improve speech intelligibility. Recently, a least-squares-based auditory attention decoding (AAD) method has been proposed to identify the target speaker from single-trial EEG recordings in an acoustic scenario with two competing speakers. Aiming at enhancing the target speaker and suppressing the interfering speaker and ambient noise, in this article, we propose a cognitive-driven speech enhancement system, consisting of a binaural beamformer which is steered based on AAD and estimated relative transfer function (RTF) vectors, which require estimates of the direction-of-arrivals (DOAs) of both speakers. For binaural beamforming and to generate reference signals for AAD, we consider either minimum-variance-distortionless-response (MVDR) beamformers or linearly-constrained-minimum-variance (LCMV) beamformers. Contrary to the binaural MVDR beamformer, the binaural LCMV beamformer allows to preserve the spatial impression of the acoustic scene and to control the suppression of the interfering speaker, which is important when intending to switch attention between speakers. The speech enhancement performance of the proposed system is evaluated in terms of the binaural signal-to-interference-plus-noise ratio ($\text {SINR}$) improvement in anechoic and reverberant conditions. Furthermore, we investigate the impact of RTF and DOA estimation errors and AAD errors on the speech enhancement performance. The experimental results show that the proposed system using LCMV beamformers yields a larger decoding performance and binaural $\text {SINR}$ improvement compared to using MVDR beamformers.

引用

页码：862 / 875

页数：14

共 44 条

[1]

[Anonymous], 2011, IEEE Signal Process. Lett

[2]

[Anonymous], 2018, The Journal of the Acoustical Society of America

[3]

[Anonymous], 2017, bioRxiv

[4]

Aroudi A, 2019, INT CONF ACOUST SPEE, P406, DOI [10.1109/ICASSP.2019.8683635, 10.1109/icassp.2019.8683635]

[5] Impact of Different Acoustic Components on EEG-Based Auditory Attention Decoding in Noisy and Reverberant Conditions [J].

Aroudi, Ali ;

Mirkovic, Bojana ;

De Vos, Maarten ;

Doclo, Simon .

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2019, 27 (04) :652-663

[6]

Aroudi A, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P851, DOI 10.1109/ICASSP.2018.8462278

[7]

Aroudi A, 2017, IEEE SYS MAN CYBERN, P3042, DOI 10.1109/SMC.2017.8123092

[8]

Aroudi A, 2017, IEEE ENG MED BIO, P484, DOI 10.1109/EMBC.2017.8036867

[9] Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario [J].

Biesmans, Wouter ;

Das, Neetha ;

Francart, Tom ;

Bertrand, Alexander .

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2017, 25 (05) :402-412

[10]

Bitzer J, 2001, DIGITAL SIGNAL PROC, P19

← 1 2 3 4 5 →