Signal Subspace-based Voice Activity Detection Using Generalized Gaussian Distribution

被引:1
作者
Um, Yong-Sub
Chang, Joon-Hyuk
Kim, Dong Kook
机构
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2013年 / 32卷 / 02期
关键词
Voice activity detection; Signal subspace; Generalized gaussian distribution;
D O I
10.7776/ASK.2013.32.2.131
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an improved voice activity detection (VAD) algorithm using statistical models in the signal subspace domain. A uncorrelated signal subspace is generated using embedded prewhitening technique and the statistical characteristics of the noisy speech and noise are investigated in this domain. According to the characteristics of the signals in the signal subspace, a new statistical VAD method using GGD (Generalized Gaussian Distribution) is proposed. Experimental results show that the proposed GGD-based approach outperforms the Gaussian-based signal subspace method at 0-15 dB SNR simulation conditions.
引用
收藏
页码:131 / 137
页数:7
相关论文
共 12 条
[1]   Voice activity detector employing generalised Gaussian distribution [J].
Chang, JH ;
Shin, JW ;
Kim, NS .
ELECTRONICS LETTERS, 2004, 40 (24) :1561-1563
[2]   Voice activity detection based on multiple statistical models [J].
Chang, Joon-Hyuk ;
Kim, Nam Soo ;
Mitra, Sanjit K. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) :1965-1976
[3]  
Cho YD, 2001, IEEE SIGNAL PROC LET, V8, P276, DOI 10.1109/97.957270
[4]   A soft voice activity detector based on a Laplacian-Gaussian model [J].
Gazor, S ;
Zhang, W .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :498-505
[5]   Order statistics in goodness-of-fit testing [J].
Glen, AG ;
Leemis, LM ;
Barr, DR .
IEEE TRANSACTIONS ON RELIABILITY, 2001, 50 (02) :209-213
[6]   A generalized subspace approach for enhancing speech corrupted by colored noise [J].
Hu, Y ;
Loizou, PC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (04) :334-341
[7]   A subspace approach based on embedded prewhitening for voice activity detection [J].
Kim, Dong Kook ;
Chang, Joon-Hyuk .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (05) :EL304-EL310
[8]   A generalized normal distribution [J].
Nadarajah, S .
JOURNAL OF APPLIED STATISTICS, 2005, 32 (07) :685-694
[9]  
Ryu KC, 2008, J ACOUST SOC KOREA, V27, P372
[10]   Statistical modeling of speech signals based on generalized gamma distribution [J].
Shin, JW ;
Chang, JH ;
Kim, NS .
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (03) :258-261