Speech Enhancement by Classification of Noisy Signals Decomposed Using NMF and Wiener Filtering

被引:0
作者
Fakhry, Mahmoud [1 ]
Poorjam, Amir Hossein [1 ]
Christensen, Mads Graesboll [1 ]
机构
[1] Aalborg Univ, CREATE, Audio Anal Lab, Aalborg, Denmark
来源
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2018年
关键词
Speech enhancement; signal decomposition; unsupervised NMF; Wiener filtering; SVM; SEPARATION EVALUATION CAMPAIGN; AUDIO SOURCE SEPARATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Supervised non-negative matrix factorization (NMF) is effective in speech enhancement through training spectral models of speech and noise signals. However, the enhancement quality reduces when the models are trained on data that is not highly relevant to a speech signal and a noise signal in a noisy observation. In this paper, we propose to train a classifier in order to overcome such poor characterization of the signals through the trained models. The main idea is to decompose the noisy observation into parts and the enhanced signal is reconstructed by combining the less-corrupted ones which are identified in the cepstral domain using the trained classifier. We apply unsupervised NMF followed by Wiener filtering for the decomposition, and use a support vector machine trained on the mel-frequency cepstral coefficients of the parts of training speech and noise signals for the classification. The results show the effectiveness of the proposed method compared with the supervised NMF.
引用
收藏
页码:16 / 20
页数:5
相关论文
共 23 条
[1]  
[Anonymous], 2013, COMPUT REV
[2]  
[Anonymous], 2009, NONNEGATIVE MATRIX T
[3]  
Benesty J., 2016, SIGNAL ENHANCEMENT V
[4]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[5]  
Deller J.R., 2000, Discrete-time Processing of Speech Signals
[6]   Audio Source Separation in Reverberant Environments Using β-Divergence-Based Nonnegative Factorization [J].
Fakhry, Mahmoud ;
Svaizer, Piergiorgio ;
Omologo, Maurizio .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) :1462-1476
[7]  
Fevotte C., 2010, CoRR, Vabs/1010.1763
[8]   Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay [J].
Gerkmann, Timo ;
Hendriks, Richard C. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04) :1383-1393
[9]   Subjective comparison and evaluation of speech enhancement algorithms [J].
Hu, Yi ;
Loizou, Philipos C. .
SPEECH COMMUNICATION, 2007, 49 (7-8) :588-601
[10]  
Laurberg H., 2008, COMP INT NEUROSC, V2008