Reliable likelihood ratios for statistical model-based voice activity detector with low false-alarm rate

被引:0
作者
Younggwan Kim
Youngjoo Suh
Hoirin Kim
机构
[1] Korea Advanced Institute of Science and Technology,Department of Electrical Engineering
来源
EURASIP Journal on Advances in Signal Processing | / 2011卷
关键词
voice activity detector; statistical model; reliability of likelihood ratio;
D O I
暂无
中图分类号
学科分类号
摘要
The role of the statistical model-based voice activity detector (SMVAD) is to detect speech regions from input signals using the statistical models of noise and noisy speech. The decision rule of SMVAD is based on the likelihood ratio test (LRT). The LRT-based decision rule may cause detection errors because of statistical properties of noise and speech signals. In this article, we first analyze the reasons why the detection errors occur and then propose two modified decision rules using reliable likelihood ratios (LRs). We also propose an effective weighting scheme considering spectral characteristics of noise and speech signals. In the experiments proposed in this study, with almost no additional computations, the proposed methods show significant performance improvement in various noise conditions. Experimental results also show that the proposed weighting scheme provides additional performance improvement over the two proposed SMVADs.
引用
收藏
相关论文
共 40 条
[1]  
Sohn J(1999)A statistical model-based voice activity detection IEEE Signal Process Lett 6 1-3
[2]  
Kim NS(2001)Analysis and improvement of a statistical model-based voice activity detector IEEE Signal Process Lett 8 276-278
[3]  
Sung W(2006)Voice activity detection based on multiple statistical models IEEE Trans Signal Process 54 1965-1976
[4]  
Cho YD(2008)Discriminative weight training for a statistical model-based voice activity detection IEEE Signal Process Lett 15 170-173
[5]  
Kondoz A(2005)Statistical voice activity detection using a multiple observation likelihood ratio test IEEE Signal Process Lett 12 689-692
[6]  
Chang J-H(2004)A new Kullback-Leibler VAD for speech recognition in noise IEEE Signal Process Lett 11 266-269
[7]  
Kim NS(2006)Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold IEEE Trans Audio Speech Lang Process 14 412-424
[8]  
Mitra SK(2008)Voice activity detection based on conditional MAP criterion IEEE Signal Process Lett 15 257-260
[9]  
Kang S-I(2006)Speech/non-speech discrimination based on contextual information integrated bispectrum LRT IEEE Signal Process Lett 13 497-500
[10]  
Jo Q-H(2004)Voice activity detector employing generalised Gaussian distribution Electron Lett 40 1561-1563