Binaural noise reduction via cue-preserving MMSE filter and adaptive-blocking-based noise PSD estimation

被引:6
作者
Azarpour, Masoumeh [1 ]
Enzner, Gerald [1 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust, Univ Str 150, Bochum, Germany
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2017年
关键词
Equalization-cancelation; Noise estimation; Cue preservation; Binaural noise reduction; Real-time listening test; SPEECH-ENHANCEMENT; THEORETICAL-ANALYSIS; HEARING-AIDS; PRESERVATION; ALGORITHMS; MODEL; COMPRESSION; DESIGN;
D O I
10.1186/s13634-017-0485-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binaural noise reduction, with applications for instance in hearing aids, has been a very significant challenge. This task relates to the optimal utilization of the available microphone signals for the estimation of the ambient noise characteristics and for the optimal filtering algorithm to separate the desired speech from the noise. The additional requirements of low computational complexity and low latency further complicate the design. A particular challenge results from the desired reconstruction of binaural speech input with spatial cue preservation. The latter essentially diminishes the utility of multiple-input/single-output filter-and-sum techniques such as beamforming. In this paper, we propose a comprehensive and effective signal processing configuration with which most of the aforementioned criteria can be met suitably. This relates especially to the requirement of efficient online adaptive processing for noise estimation and optimal filtering while preserving the binaural cues. Regarding noise estimation, we consider three different architectures: interaural (ITF), cross-relation (CR), and principal-component (PCA) target blocking. An objective comparison with two other noise PSD estimation algorithms demonstrates the superiority of the blocking-based noise estimators, especially the CR-based and ITF-based blocking architectures. Moreover, we present a new noise reduction filter based on minimum mean-square error (MMSE), which belongs to the class of common gain filters, hence being rigorous in terms of spatial cue preservation but also efficient and competitive for the acoustic noise reduction task. A formal real-time subjective listening test procedure is also developed in this paper. The proposed listening test enables a real-time assessment of the proposed computationally efficient noise reduction algorithms in a realistic acoustic environment, e.g., considering time-varying room impulse responses and the Lombard effect. The listening test outcome reveals that the signals processed by the blocking-based algorithms are significantly preferred over the noisy signal in terms of instantaneous noise attenuation. Furthermore, the listening test data analysis confirms the conclusions drawn based on the objective evaluation.
引用
收藏
页数:17
相关论文
共 85 条
  • [1] SHORT-TERM SPECTRAL ANALYSIS, SYNTHESIS, AND MODIFICATION BY DISCRETE FOURIER-TRANSFORM
    ALLEN, JB
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1977, 25 (03): : 235 - 238
  • [2] [Anonymous], P INT WORKSH AC SIGN
  • [3] [Anonymous], 2008, Springer handbook of speech processing
  • [4] [Anonymous], 2005, P INT WORKSH AC SIGN
  • [5] [Anonymous], INT J COMPUT SCI ISS
  • [6] [Anonymous], IHCON
  • [7] [Anonymous], 2011, IEEE SIGNAL PROC LET
  • [8] [Anonymous], REC P 835 SUBJ TEST
  • [9] [Anonymous], 2000, DOCU MENT ITU T REC
  • [10] [Anonymous], P INT C AC SPEECH SI