SNMF Based Speech Denoising with Wavelet Decomposed Signal Selection

被引:0
作者
Varshney, Yash Vardhan [1 ]
Abbasi, Z. A. [1 ]
Abidi, M. R. [1 ]
Farooq, Omar [1 ]
Upadhyaya, Prashant [1 ]
机构
[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh, Uttar Pradesh, India
来源
2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET) | 2017年
关键词
Non-negative matrix factorization; discrete wavelet transform; monaural source separation; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposed the model of non-negative matrix factorization (NMF) with the effect of digital wavelet decomposition in speech denoising. Sparse NMF has been used over magnitude spectrogram of speech signal to find the basis vectors of training and weights of test signal. The results are validating the effect of wavelet decomposition on the performance. To test the algorithm, TIMIT data for speech signal database and Noisex92 data for noise database was used. The performance measurement has been taken in terms of signal to distortion ratio (SDR), signal to artifacts ratio (SAR), signal to interference ratio (SIR), perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility measure (STOi). Here, the separation of speech signal from the noisy signal has been performed with and without prior knowledge of noise. Results are compared with the existing algorithms. Proposed model has shown improvements over the existing models in both conditions.
引用
收藏
页码:2603 / 2606
页数:4
相关论文
共 22 条
[1]  
Akansu AN, 2001, Multiresolution Signal Decomposition: Transforms, Subbands, and Wavelets
[2]  
[Anonymous], ELECT LETT
[3]  
[Anonymous], 2005, Tech. Rep. 1706
[4]  
Bavkar S, 2013, 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), P1103, DOI 10.1109/ICACCI.2013.6637331
[5]   Noise Perturbation Improves Supervised Speech Separation [J].
Chen, Jitong ;
Wang, Yuxuan ;
Wang, DeLiang .
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 :83-90
[6]   Monaural speech separation and recognition challenge [J].
Cooke, Martin ;
Hershey, John R. ;
Rennie, Steven J. .
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01) :1-15
[7]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[8]  
Hoyer PO, 2004, J MACH LEARN RES, V5, P1457
[9]   Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation [J].
Huang, Po-Sen ;
Kim, Minje ;
Hasegawa-Johnson, Mark ;
Smaragdis, Paris .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) :2136-2147
[10]  
ITU, PERC EV SPEECH QUAL, V862, P862