CLUSTERING AND SUPPRESSION OF TRANSIENT NOISE IN SPEECH SIGNALS USING DIFFUSION MAPS

被引:0
作者
Talmon, Ronen [1 ]
Cohen, Israel [1 ]
Gannot, Sharon
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Technion, Haifa, Israel
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
Speech enhancement; speech processing; acoustic noise; impulse noise; transient noise; ENHANCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently we have presented a novel approach for transient noise reduction that relies on non-local (NL) filtering. In this paper, we modify and extend our approach to support clustering and suppression of a few transient noise types simultaneously, by introducing two novel concepts. We observe that voiced speech spectral components are slowly varying compared to transient noise. Thus, by applying an algorithm for noise power spectral density (PSD) estimation, configured to track faster variations than pseudo-stationary noise, the PSD of speech components may be estimated. In addition, we utilize diffusion maps to embed the measurements into a new domain. We obtain a new representation which enables clustering of different transient noise types. The new representation is incorporated into a NL filter as a better affinity metric for averaging over transient instances. Experimental results show that the proposed algorithm enables clustering and suppression of multiple transient interferences.
引用
收藏
页码:5084 / 5087
页数:4
相关论文
共 10 条
[1]  
Chung F., 1992, Spectral Graph Theory
[2]   Noise estimation by minima controlled recursive averaging for robust speech enhancement [J].
Cohen, I ;
Berdugo, B .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) :12-15
[3]   Speech enhancement for non-stationary noise environments [J].
Cohen, I ;
Berdugo, B .
SIGNAL PROCESSING, 2001, 81 (11) :2403-2418
[4]   Diffusion maps [J].
Coifman, Ronald R. ;
Lafon, Stephane .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2006, 21 (01) :5-30
[5]   RASTA Processing of Speech [J].
Hermansky, Hynek ;
Morgan, Nelson .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :578-589
[6]   Data fusion and multicue data matching by diffusion maps [J].
Lafon, Stephane ;
Keller, Yosi ;
Coifman, Ronald R. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (11) :1784-1797
[7]   Diffusion Interpretation of Nonlocal Neighborhood Filters for Signal Denoising [J].
Singer, Amit ;
Shkolnisky, Yoel ;
Nadler, Boaz .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :118-139
[8]  
Talmon R., 2010, IEEE T AUDIO SPEECH
[9]   SPEECH ENHANCEMENT IN TRANSIENT NOISE ENVIRONMENT USING DIFFUSION FILTERING [J].
Talmon, Ronen ;
Cohen, Israel ;
Gannot, Sharon .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :4782-4785
[10]  
Vaseghi S.V., 2006, ADV DIGITAL SIGNAL P