An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones

被引:0
作者
Randall Ali
Toon van Waterschoot
Marc Moonen
机构
[1] KU Leuven,
[2] Department of Electrical Engineering (ESAT),undefined
[3] STADIUS Center for Dynamical Systems,undefined
[4] Signal Processing,undefined
[5] and Data Analytics,undefined
来源
EURASIP Journal on Audio, Speech, and Music Processing | / 2021卷
关键词
Speech enhancement; Beamforming; Minimum variance distortionless response (MVDR) beamformer; External microphones;
D O I
暂无
中图分类号
学科分类号
摘要
An integrated version of the minimum variance distortionless response (MVDR) beamformer for speech enhancement using a microphone array has been recently developed, which merges the benefits of imposing constraints defined from both a relative transfer function (RTF) vector based on a priori knowledge and an RTF vector based on a data-dependent estimate. In this paper, the integrated MVDR beamformer is extended for use with a microphone configuration where a microphone array, local to a speech processing device, has access to the signals from multiple external microphones (XMs) randomly located in the acoustic environment. The integrated MVDR beamformer is reformulated as a quadratically constrained quadratic program (QCQP) with two constraints, one of which is related to the maximum tolerable speech distortion for the imposition of the a priori RTF vector and the other related to the maximum tolerable speech distortion for the imposition of the data-dependent RTF vector. An analysis of how these maximum tolerable speech distortions affect the behaviour of the QCQP is presented, followed by the discussion of a general tuning framework. The integrated MVDR beamformer is then evaluated with audio recordings from behind-the-ear hearing aid microphones and three XMs for a single desired speech source in a noisy environment. In comparison to relying solely on an a priori RTF vector or a data-dependent RTF vector, the results demonstrate that the integrated MVDR beamformer can be tuned to yield different enhanced speech signals, which may be more suitable for improving speech intelligibility despite changes in the desired speech source position and imperfectly estimated spatial correlation matrices.
引用
收藏
相关论文
共 71 条
[1]  
Gannot S.(2017)A consolidated perspective on multi-microphone speech enhancement and source separation IEEE/ACM Trans. Audio Speech Lang. Process. 25 692-730
[2]  
Vincent E.(2016)Binaural noise cue preservation in a binaural noise reduction system with a remote microphone signal IEEE/ACM Trans. Audio Speech Lang. Process. 24 952-966
[3]  
Markovich-Golan S.(2013)Speech enhancement using a remote wireless microphone IEEE Trans. Consum. Electron. 59 167-174
[4]  
Ozerov A.(2017)A noise reduction post-filter for binaurally-linked single-microphone hearing aids utilizing a nearby external microphone IEEE/ACM Trans. Audio Speech Lang. Process. 26 5-18
[5]  
Szurley J.(2019)Methods of extending a generalized sidelobe canceller with external microphones IEEE/ACM Trans. Audio Speech Lang. Process. 27 1349-1364
[6]  
Bertrand A.(2009)Robust distributed noise reduction in hearing aids with external acoustic sensor nodes EURASIP J. Adv. Signal Process. 2009 1-14
[7]  
van Dijk B.(2013)Distributed multiple constraints generalized sidelobe canceler for fully connected wireless acoustic sensor networks IEEE Trans. Audio Speech Lang. Process. 21 343-356
[8]  
Moonen M.(1969)High-resolution frequency-wavenumber spectrum analysis Proc. IEEE 57 1408-1418
[9]  
Cvijanovic N.(2001)Signal enhancement using beamforming and nonstationarity with applications to speech IEEE Trans. Signal Process. 49 1614-1626
[10]  
Sadiq O.(1992)Evaluation of an adaptive beamforming method for hearing aids J. Acoust. Soc. Amer. 91 1662-1676