ON USING SPECTRAL GRADIENT IN CONDITIONAL MAP CRITERION FOR ROBUST VOICE ACTIVITY DETECTION

被引:0
作者
Choi, Jae-Hun [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Hanyang Univ, Sch Elect Engn, Seoul 133791, South Korea
来源
PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012) | 2012年
关键词
Voice activity detection; Spectral gradient; Conditional MAP; Likelihood ratio test;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a novel approach to improve a statistical model-based voice activity detection (VAD) method based on a modified conditional maximum a posteriori (MAP) criterion incorporating the spectral gradient scheme. The proposed conditional MAP incorporates not only the voice activity decision in the previous frame as in Ref. [1] but also the spectral gradient of the observed spectra between the current frame and the past frames to efficiently exploit the inter-frame correlation of voice activity. As a result, the proposed VAD leads to six separate thresholds to be adaptively determined in the likelihood ratio test (LRT) depending on both the previous VAD result and the estimated spectral gradient parameter. Experimental results demonstrate that the proposed approach yields better results compared to those of the previous conditional MAP-based method.
引用
收藏
页码:370 / 374
页数:5
相关论文
共 50 条
[31]   Robust voice activity detection using higher-order statistics in the LPC residual domain [J].
Nemer, E ;
Goubran, R ;
Mahmoud, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03) :217-231
[32]   ROBUST VOICE ACTIVITY DETECTION USING A MASKED AUDITORY ENCODER BASED CONVOLUTIONAL NEURAL NETWORK [J].
Li, Nan ;
Wang, Longbiao ;
Unoki, Masashi ;
Li, Sheng ;
Wang, Rui ;
Ge, Meng ;
Dang, Jianwu .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :6828-6832
[33]   Efficient voice activity detection algorithm using long-term spectral flatness measure [J].
Yanna Ma ;
Akinori Nishihara .
EURASIP Journal on Audio, Speech, and Music Processing, 2013
[34]   Voice activity detection using neural network [J].
Ikedo, J .
IEICE TRANSACTIONS ON COMMUNICATIONS, 1998, E81B (12) :2509-2513
[35]   VOICE ACTIVITY DETECTION USING A PERIODICITY MEASURE [J].
TUCKER, R .
IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1992, 139 (04) :377-380
[36]   VOICE ACTIVITY DETECTION USING SUBBAND NONCIRCULARITY [J].
Wisdom, Scott ;
Okopal, Greg ;
Atlas, Les ;
Pitton, James .
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, :4505-4509
[37]   Robust Voice Activity Detection Against Non Homogeneous Noisy Environments [J].
Chelloug, Charaf Eddine ;
Farrouki, Atef .
2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
[38]   A New Robust Voice Activity Detection method based on Genetic Algorithm [J].
Farsinejad, M. ;
Analoui, M. .
ATNAC: 2008 AUSTRALASIAN TELECOMMUNICATION NETWOKS AND APPLICATIONS CONFERENCE, 2008, :80-84
[39]   Robust Voice Activity Detection Based on Complementary BLSTM Enhancement Stage [J].
Shahryary, Iman ;
Seyedin, Sanaz ;
Ahadi, Seyed Mohammad .
2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, :1608-1612
[40]   Noise Robust Voice Activity Detection Based on Switching Kalman Filter [J].
Fujimoto, Masakiyo ;
Ishizuka, Kentaro .
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, :965-968