Silence and speech segmentation for noisy speech using a wavelet based algorithm

被引:0
作者
Mei, XD [1 ]
Sun, SH [1 ]
机构
[1] Harbin Inst Technol, Dept Automat Test & Control, Harbin 150001, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2001年 / 10卷 / 04期
关键词
speech segmentation; wavelet transform; cross-correlation; low-energy phoneme;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a new method to segment silence and speech for noisy condition. Conventional segmentation methods usually lack in robustness under high background noise because they are mostly dependent on amplitude or energy of speech signal. For speech signal, the correlation between the neighbor frequency bands including most speech energy is high and little effected by noise, but the correlation between the neighbor frequency bands of noise is low. So we employ the crosscorrelation of neighbor sub-bands of signal to locate speech and noise. We first performed the wavelet transform to denoise and further calculated the crosscorrelation between the wavelet coefficients in two selected sub-bands and then used the standard deviation of cross-correlation coefficients to segment speech and noise duration. The simulation and the result analysis show that this method is efficient for the low-energy phonemes even in low signal-to-noise ratio, and the amount of computation is less.
引用
收藏
页码:439 / 443
页数:5
相关论文
共 8 条
[1]  
AMIN N, 1999, EUROSPEECH 99, V2, P903
[2]  
BURLEY S, 1997, P IEEE INT C AC SPEE, P3417
[3]   The discrete multiple wavelet transform and thresholding methods [J].
Downie, TR ;
Silverman, BW .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1998, 46 (09) :2558-2561
[4]  
KARRAY L, 1998, P ICSLP 99 DEC, P1471
[5]  
KARRAY L, 1999, EUROSPEECH 99 BUD HU, V5, P2379
[6]  
KARRAY L, 1999, J BEIJING U POSTS TE, V22, P31
[7]   A THEORY FOR MULTIRESOLUTION SIGNAL DECOMPOSITION - THE WAVELET REPRESENTATION [J].
MALLAT, SG .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (07) :674-693
[8]  
Strand G., 1996, WAVELETS FILTER BANK