ON USING SPECTRAL GRADIENT IN CONDITIONAL MAP CRITERION FOR ROBUST VOICE ACTIVITY DETECTION

被引:0
作者
Choi, Jae-Hun [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Hanyang Univ, Sch Elect Engn, Seoul 133791, South Korea
来源
PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012) | 2012年
关键词
Voice activity detection; Spectral gradient; Conditional MAP; Likelihood ratio test;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel approach to improve a statistical model-based voice activity detection (VAD) method based on a modified conditional maximum a posteriori (MAP) criterion incorporating the spectral gradient scheme. The proposed conditional MAP incorporates not only the voice activity decision in the previous frame as in Ref. [1] but also the spectral gradient of the observed spectra between the current frame and the past frames to efficiently exploit the inter-frame correlation of voice activity. As a result, the proposed VAD leads to six separate thresholds to be adaptively determined in the likelihood ratio test (LRT) depending on both the previous VAD result and the estimated spectral gradient parameter. Experimental results demonstrate that the proposed approach yields better results compared to those of the previous conditional MAP-based method.
引用
收藏
页码:370 / 374
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 1999, 301708 ETSI EN
[2]   Voice activity detector employing generalised Gaussian distribution [J].
Chang, JH ;
Shin, JW ;
Kim, NS .
ELECTRONICS LETTERS, 2004, 40 (24) :1561-1563
[3]   Voice activity detection based on complex Laplacian model [J].
Chang, JH ;
Kim, NS .
ELECTRONICS LETTERS, 2003, 39 (07) :632-634
[4]  
CHANG JH, 2003, P EUR GEN SWITZ, P1065
[5]   Voice activity detection based on multiple statistical models [J].
Chang, Joon-Hyuk ;
Kim, Nam Soo ;
Mitra, Sanjit K. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) :1965-1976
[6]  
Cho Y.D., 2001, IEEE INT C AC SPEECH, V2, P7
[7]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[8]  
GPP2 Spec, 2004, 3GPP2CS00140, V1.0
[9]  
ITU-T, 1996, SIL COMPR SCHEM G 72
[10]   Statistical voice activity detection using a multiple observation likelihood ratio test [J].
Ramírez, J ;
Segura, JC ;
Benítez, C ;
García, L ;
Rubio, A .
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (10) :689-692