Application of GM(1,1) model to voice activity detection

被引:0
作者
Cheng-Hsiung Hsieh [1 ]
Ting-Yu Feng [1 ]
机构
[1] Chaoyang Univ Technol, Dept Comp Sci & Informat Engn, Wufong 41349, Taiwan
来源
2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS | 2006年
关键词
D O I
10.1109/ICSMC.2006.384480
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel approach to apply GM(1,1) model in voice activity detection (VAD) is presented. The approach is termed as grey VAD (GVAD). In GVAD, the GM(1,1) model is used to estimate non-stationary noise in noisy speech and therefore signal component where an additive signal model is assumed. By estimated noise and signal, the signal-to-noise ratio (SNR) is calculated. Based on an adaptive threshold, the speech and non-speech segments are determined. The proposed GVAD is performed in the time domain and thus has less computational complexity than those frequency domain approaches. Through simulation, the GVAD is verified by cases with non-stationary noise. The result indicates that the proposed GVAD is able to detect voice activity appropriately.
引用
收藏
页码:770 / +
页数:3
相关论文
共 12 条
[1]   Voice activity detection based on complex Laplacian model [J].
Chang, JH ;
Kim, NS .
ELECTRONICS LETTERS, 2003, 39 (07) :632-634
[2]  
Chang JH, 2001, IEICE T INF SYST, VE84D, P1231
[3]  
Childers D. G., 1999, SPEECH PROCESSING SY
[4]   CONTROL-PROBLEMS OF GREY SYSTEMS [J].
DENG, JL .
SYSTEMS & CONTROL LETTERS, 1982, 1 (05) :288-294
[5]  
Deng Julong, 1989, Journal of Grey Systems, V1, P1
[6]  
ESTEVEZ PA, 2005, ELECT LETT, V41
[7]   Voice activity detection algorithm using radial basis function network [J].
Kim, HI ;
Park, SK .
ELECTRONICS LETTERS, 2004, 40 (22) :1454-1456
[8]   THEORY OF ORDER STATISTIC FILTERS AND THEIR RELATIONSHIP TO LINEAR FIR FILTERS [J].
LONGBOTHAM, HG ;
BOVIK, AC .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (02) :275-287
[9]   An effective subband OSF-based VAD with noise reduction for robust speech recognition [J].
Ramírez, J ;
Segura, JC ;
Benítez, C ;
de la Torre, A ;
Rubio, A .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06) :1119-1129
[10]   Efficient voice activity detection algorithms using long-term speech information [J].
Ramírez, J ;
Segura, JC ;
Benítez, C ;
de la Torre, A ;
Rubio, A .
SPEECH COMMUNICATION, 2004, 42 (3-4) :271-287