IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS

被引:4
作者
Liu, Di [1 ]
Khong, Andy W. H. [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
来源
2010 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2010) | 2010年
关键词
denoising; wavelet; speech source localization; DOA estimation; SHRINKAGE;
D O I
10.1109/CW.2010.69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The generalized cross-correlation using the phase transform prefilter remains popular for the estimation of time-differences-of-arrival. However it is not robust to noise and as a consequence, the performance of direction-of-arrival algorithms is often degraded under low signal-to-noise condition. We propose to address this problem through the use of a wavelet-based speech enhancement technique since the wavelet transform can achieve good denoising performance. The overcomplete rational-dilation wavelet transform is then exploited to effectively process speech signals due to its higher frequency resolution. In addition, we exploit the joint distribution of the speech in the wavelet domain and develop a novel local noise variance estimator based on the bivariate shrinkage function. As will be shown, our proposed algorithm achieves good direction-of-arrival performance in the presence of noise.
引用
收藏
页码:77 / 81
页数:5
相关论文
共 10 条
[1]  
[Anonymous], P IEEE INT C AC SPEE
[2]   Frequency-Domain Design of Overcomplete Rational-Dilation Wavelet Transforms [J].
Bayram, Ilker ;
Selesnick, Ivan W. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (08) :2957-2972
[3]   Wavelet-based signal de-noising via simple singularities approximation [J].
Bruni, V ;
Vitulano, D .
SIGNAL PROCESSING, 2006, 86 (04) :859-876
[4]  
BURRUS CS, 1997, PRENTICE HALL
[5]   IDEAL SPATIAL ADAPTATION BY WAVELET SHRINKAGE [J].
DONOHO, DL ;
JOHNSTONE, IM .
BIOMETRIKA, 1994, 81 (03) :425-455
[6]   GENERALIZED CORRELATION METHOD FOR ESTIMATION OF TIME-DELAY [J].
KNAPP, CH ;
CARTER, GC .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (04) :320-327
[7]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512
[8]   Image denoising using derotated complex wavelet coefficients [J].
Miller, Mark ;
Kingsbury, Nick .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (09) :1500-1511
[9]   Bivariate shrinkage functions for wavelet-based denoising exploiting interscale dependency [J].
Sendur, L ;
Selesnick, IW .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (11) :2744-2756
[10]  
Zhang C, 2008, INT CONF ACOUST SPEE, P2565