Dereverberation and denoising using multichannel linear prediction

被引:26
作者
Delcroix, Marc [1 ]
Hikichi, Takafumi
Miyoshi, Masato
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
[2] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 06期
关键词
blind dereverberation; denoising; linear prediction (LP); multichannel;
D O I
10.1109/TASL.2007.899286
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Reverberation in a room severely degrades the characteristics and auditory quality of speech captured by distant microphones, thus posing a severe problem for many speech applications. Several dereverberation techniques have been proposed with a view to solving this problem. There are, however, few reports of dereverberation methods working under noisy conditions. In this paper, we propose an extension of a dereverberation algorithm based on multichannel linear prediction that achieves both the dereverberation and noise reduction of speech in an acoustic environment with a colored noise source. The method consists of two steps. First, the speech residual is estimated from the observed signals by employing multichannel linear prediction. When we use a microphone array, and assume, roughly speaking, that one of the microphones is closer to the speaker than the noise source, the speech residual is unaffected by the room reverberation or the noise. However, the residual is degraded because linear prediction removes an average of the speech characteristics. In a second step, the average of the speech characteristics is estimated and used to recover the speech. Simulations were conducted for a reverberation time of 0.5 s and an input signal-to-noise ratio of 0 dB. With the proposed method, the reverberation was suppressed by more than 20 dB and the noise level reduced to -18 dB.
引用
收藏
页码:1791 / 1801
页数:11
相关论文
共 31 条
[1]   A signal subspace tracking algorithm for microphone array processing of speech [J].
Affes, S ;
Grenier, Y .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05) :425-437
[2]  
AICHNER R, 2006, P ICASSP, V5, P37
[3]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[4]  
[Anonymous], DISCRETE SIGNAL PROC
[5]  
*ATR INT, SPEECH DAT
[6]  
BOBILLET W, 2004, P ICASSP04, V2, P777
[7]  
DELCROIX M, 2006, T IEICE, P2837
[8]  
DELCROIX M, 2005, P INT 05, P2309
[9]  
DELCROIX M, 2006, P ICASSP 06, V1, P825
[10]   Precise dereverberation using multichannel linear prediction [J].
Delcroix, Marc ;
Hikichi, Takafumi ;
Miyoshi, Masato .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02) :430-440