A SOLUTION TO RESIDUAL NOISE IN SPEECH DENOISING WITH SPARSE REPRESENTATION

被引:0
作者
He, Yongjun [1 ]
Han, Jiqing [1 ]
Deng, Shiwen [1 ]
Zheng, Tieran [1 ]
Zheng, Guibin [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
来源
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年
关键词
Sparse representation; speech denoising; residual noise; basis pursuit denoising;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
As a promising technique, sparse representation has been extensively investigated in signal processing community. Recently, sparse representation is widely used for speech processing in noisy environments; however, many problems need to be solved because of the particularity of speech. One assumption for speech denoising with sparse representation is that the representation of speech over the dictionary is sparse, while that of the noise is dense. Unfortunately, this assumption is not sustained in speech denoising scenario. We find that many noises, e.g., the babble and white noises, are also sparse over the dictionary trained with clean speech, resulting in severe residual noise in sparse enhancement. To solve this problem, we propose a novel residual noise reduction (RNR) method which first finds out the atoms which represents the noise sparely, and then ignores them in the reconstruction of speech. Experimental results show that the proposed method can reduce residual noise substantially.
引用
收藏
页码:4653 / 4656
页数:4
相关论文
共 11 条
[1]  
[Anonymous], P ICML
[2]  
[Anonymous], 2011, Compressed Sensing: Theory and Applications
[3]   An energy budget for signaling in the grey matter of the brain [J].
Attwell, D ;
Laughlin, SB .
JOURNAL OF CEREBRAL BLOOD FLOW AND METABOLISM, 2001, 21 (10) :1133-1145
[4]  
Chen SSB, 2001, SIAM REV, V43, P129, DOI [10.1137/S003614450037906X, 10.1137/S1064827596304010]
[5]  
Durrieu L.-L., 2011, P ICASSP, P5260
[6]   Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition [J].
Gemmeke, Jort F. ;
Virtanen, Tuomas ;
Hurmalainen, Antti .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07) :2067-2080
[7]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512
[8]   Emergence of simple-cell receptive field properties by learning a sparse code for natural images [J].
Olshausen, BA ;
Field, DJ .
NATURE, 1996, 381 (6583) :607-609
[9]  
Philipos CL., 2007, SPEECH ENHANCEMENT T
[10]   SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES [J].
Sigg, Christian D. ;
Dikk, Tomas ;
Buhmann, Joachim M. .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :4758-4761