On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction

被引:245
作者
Souden, Mehrez [1 ]
Benesty, Jacob [1 ]
Affes, Sofiene [1 ]
机构
[1] Univ Quebec, INRS, EMT, Montreal, PQ H5A 1K6, Canada
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2010年 / 18卷 / 02期
基金
加拿大自然科学与工程研究理事会;
关键词
Generalized sidelobe canceller (GSC); microphone arrays; minimum variance distortionless response (MVDR); noise reduction; parameterized non-causal multichannel Wiener filter; speech distortion; SPEECH ENHANCEMENT; WIENER FILTER; SUBSPACE APPROACH; COLORED NOISE; DEREVERBERATION; ALGORITHM;
D O I
10.1109/TASL.2009.2025790
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several contributions have been made so far to develop optimal multichannel linear filtering approaches and show their ability to reduce the acoustic noise. However, there has not been a clear unifying theoretical analysis of their performance in terms of both noise reduction and speech distortion. To fill this gap, we analyze the frequency-domain (non-causal) multichannel linear filtering for noise reduction in this paper. For completeness, we consider the noise reduction constrained optimization problem that leads to the parameterized multichannel non-causal Wiener filter (PMWF). Our contribution is fivefold. First, we formally show that the minimum variance distortionless response (MVDR) filter is a particular case of the PMWF by properly formulating the constrained optimization problem of noise reduction. Second, we propose new simplified expressions for the PMWF, the MVDR, and the generalized sidelobe canceller (GSC) that depend on the signals' statistics only. In contrast to earlier works, these expressions are explicitly independent of the channel transfer function ratios. Third, we quantify the theoretical gains and losses in terms of speech distortion and noise reduction when using the PWMF by establishing new simplified closed-form expressions for three performance measures, namely, the signal distortion index, the noise reduction factor (originally proposed in the paper titled "New insights into the noise reduction Wiener filter," by J. Chen et al. (IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, no. 4, pp. 1218-1234, Jul. 2006) to analyze the single channel time-domain Wiener filter), and the output signal-to-noise ratio (SNR). Fourth, we analyze the effects of coherent and incoherent noise in addition to the benefits of utilizing multiple microphones. Fifth, we propose a new proof for the a posteriori SNR improvement achieved by the PMWF. Finally, we provide some simulations results to corroborate the findings of this work.
引用
收藏
页码:260 / 276
页数:17
相关论文
共 40 条
[1]   A signal subspace tracking algorithm for microphone array processing of speech [J].
Affes, S ;
Grenier, Y .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05) :425-437
[2]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[3]  
[Anonymous], 2005, Speech Enhancement
[4]  
[Anonymous], 1977, DISCRETE TIME SIGNAL
[5]  
[Anonymous], 2007, Speech Enhancement: Theory and Practice
[6]   A generalized MVDR spectrum [J].
Benesty, J ;
Chen, JD ;
Huang, YT .
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (12) :827-830
[7]   On the importance of the Pearson correlation coefficient in noise reduction [J].
Benesty, Jacob ;
Chen, Jingdong ;
Huang, Yiteng .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04) :757-765
[8]   On microphone-array beamforming from a MIMO acoustic signal processing perspective [J].
Benesty, Jacob ;
Chen, Jingdong ;
Huang, Yiteng ;
Dmochowski, Jacek .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03) :1053-1065
[9]  
Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1
[10]  
Breed BR, 2002, IEEE SIGNAL PROC LET, V9, P168, DOI [10.1109/LSP.2002.800506, 10.1109/LSR2002.800506]