Speech enhancement via Mel-scale Wiener filtering with a frequency-wise voice activity detector

被引:0
作者
Hwa Soo Kim
Young Man Cho
Han-Jun Kim
机构
[1] Finetec Century,School of Mechanical and Aerospace Engineering
[2] Seoul National University,undefined
来源
Journal of Mechanical Science and Technology | 2007年 / 21卷
关键词
Speech enhancement; Wiener filtering; Mel-scale; Voice activity detector;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering algorithm. The proposed VAD classifies a given data frame as speech or noise at each frequency, enabling the frequency-wise updates of noise statistics and thereby improving the effectiveness of the noise filtering algorithms by providing more up-to-date noise statistics. The celebrated Wiener filter is adopted in this paper as the accompanying noise filtering algorithm, which results in significant noise suppression. Yet, the musical noise present in most Wiener filter-based systems prompts the idea of applying the Wiener filter in the Mel-scale in which the human auditory system responds to the external stimulation. It turns out that the Mel-scale Wiener filter creates some masking effects and thereby reduces musical noise significantly, leading to smooth transition between data frames.
引用
收藏
相关论文
共 17 条
[1]  
Boll S. F.(1979)Suppression of Acoustic Noise in Speech using Spectral Subtraction IEEE Trans. Acoustics, Speech, Signal Processing 27 113-120
[2]  
Cappe O.(1994)Elimination of Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor IEEE Trans. Speech and Audio Processing 2 345-349
[3]  
Chen G.(2003)Enhanced Itakura Measure Incorporating Masking Properties of Human Auditory System Signal Processing 83 1445-1456
[4]  
Koh S. N.(1995)Computationally Efficient Speech Enhancement By Spectral Minima Tracking In Subbands EUROSPEECH ’95 2 1513-1516
[5]  
Soon I. Y.(2005)A Study on the Body Attachment Stiffness for the Road Noise Journal of Mechanical Science and Technology 19 1034-1312
[6]  
Doblinger G.(1982)A Generalized Comb Filtering Technique for Speech Enhancement Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing 7 160-163
[7]  
Kim K. C.(2001)Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics IEEE Trans. Speech, Audio Processing 9 504-512
[8]  
Kim C. M.(2000)Noise Reduction of Muffler by Optimal Design Journal of Mechanical Science and Technology 14 947-955
[9]  
Malah D.(1999)A Statistical Model-Based Voice Activity Detection IEEE Signal Processing Letters 6 365-368
[10]  
Cox R. V.(1989)Noise Adaptation in a Hidden Markov Model Speech Recognition System Computer Speech Language 3 151-167