Audible noise reduction in eigendomain for speech enhancement

被引:10
作者
You, Chang Huai [1 ]
Rahardja, Susanto
Koh, Soo Ngee
机构
[1] Inst Infocomm Res, Singapore 119613, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 06期
关键词
audible noise reduction; eigen-decomposition; masking properties; signal subspace; speech enhancement;
D O I
10.1109/TASL.2007.899288
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A signal subspace scheme based on masking properties is proposed for enhancement of speech degraded by additive noise. Since the masking properties are related to the critical frequency band that is derived from the characteristics of human cochlea, the incorporation of masking threshold into a subspace technique requires the transformation between the frequency and eigen domains. We present and apply an invertible transformation between the frequency and eigen domains. In this paper, we use masking properties of the human auditory system to define the audible noise quantity in the eigendomain. We derive the eigen-decomposition of the estimated speech autocorrelation matrix with the assumption of white noise. Subsequently, an audible noise reduction scheme is developed based on a signal subspace technique, and the implementation of our proposed scheme is outlined. We further extend the scheme to the colored noise case. Simulation results show the superiority of our proposed scheme over other existing subspace methods in terms of segmental signal-to-noise ratio (SNR), perceptual evaluation of speech quality (PESQ), modified Bark spectral distortion (MBSD), spectrogram and informal listening tests.
引用
收藏
页码:1753 / 1765
页数:13
相关论文
共 31 条
[1]  
[Anonymous], P INT C AC SPEECH SI
[2]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[3]   ON THE APPLICATION OF HIDDEN MARKOV-MODELS FOR ENHANCING NOISY SPEECH [J].
EPHRAIM, Y ;
MALAH, D ;
JUANG, BH .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :1846-1856
[4]   A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].
EPHRAIM, Y ;
VANTREES, HL .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266
[5]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[6]  
GAROFOLO JS, 1988, GETTING STARTED WITH
[7]   FILTERING OF COLORED NOISE FOR SPEECH ENHANCEMENT AND CODING [J].
GIBSON, JD ;
KOO, BR ;
GRAY, SD .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (08) :1732-1742
[8]  
Golub GH, 2013, Matrix Computations, V4
[9]   ROBUST ESTIMATION OF SPEECH IN NOISY BACKGROUNDS BASED ON ASPECTS OF THE AUDITORY PROCESS [J].
HANSEN, JHL ;
NANDKUMAR, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (06) :3833-3849
[10]  
Hayes M, 1996, STAT DIGITAL PROCESS