A robust voice activity detector for wireless communications using soft computing

被引:69
作者
Beritelli, F [1 ]
Casale, S [1 ]
Cavallaro, A [1 ]
机构
[1] Univ Catania, Ist Informat & Telecomunicaz, I-95125 Catania, Italy
关键词
fuzzy logic (FL); mobile communication; pattern recognition; speech coding; voice activity detection;
D O I
10.1109/49.737650
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Discontinuous transmission based on speech/pause detection represents a valid solution to improve the spectral efficiency of new-generation wireless communication systems. In this contest, robust voice activity detection (VAD) algorithms are required, as traditional solutions present a high misclassification rate in the presence of the background noise typical of mobile environments, This paper presents a voice detection algorithm which is robust to noisy environments, thanks to a new methodology adopted for the matching process. More specifically, the VAD proposed is based on a pattern recognition approach in which the matching phase is performed by a set of sis fuzzy rules, trained by means of a new hybrid learning tool. A series of objective tests performed on a large speech database, varying the signal-to-noise ratio (SNR), the types of background noise, and the input signal level, showed that, as compared with the VAD recently standardized by ITU-T in Recommendation G.729 annex B, the fuzzy VAD, on average, achieves an improvement in reduction both of the activity factor of about 25% and of the clipping introduced of about 43%. Informal listening tests also confirm an improvement in the perceived speech quality.
引用
收藏
页码:1818 / 1829
页数:12
相关论文
共 28 条
[1]  
BARRET P, 1997, 29EWP316 ITUT
[2]   ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications [J].
Benyassine, A ;
Shlomot, E ;
Su, HY ;
Massaloux, D ;
Lamblin, C ;
Petit, JP .
IEEE COMMUNICATIONS MAGAZINE, 1997, 35 (09) :64-73
[3]  
Beritelli F., 1995, Proceedings of ISUMA - NAFIPS '95 The Third International Symposium on Uncertainty Modeling and Analysis and Annual Conference of the North American Fuzzy Information Processing Society (Cat. No.95TB8082), P589, DOI 10.1109/ISUMA.1995.527761
[4]  
BERITELLI F, 1997, P EUR S INT TECHN ES, P91
[5]  
BERITELLI F, 1995, P EUROSPEECH 9K, P389
[6]  
BERITELLI F, 1997, P IEEE WORKSH SPEECH, P5
[7]  
BERITELLI F, 1997, THESIS U CATANIA CAT
[8]  
BERITELLI F, 1997, AHQ196 ITUT
[9]  
BERITELLI F, 1997, WP312 ITUT
[10]  
BERITELLI F, 1995, P IEEE WORKSH SPEECH, P97