Sparse Representations for Single Channel Speech Enhancement Based on Voiced/Unvoiced Classification

被引:0
作者
Mohamed Anouar Ben Messaoud
Aïcha Bouzid
机构
[1] University of Tunis El Manar,Electrical Engineering Department, National School of Engineers of Tunis
来源
Circuits, Systems, and Signal Processing | 2017年 / 36卷
关键词
Speech enhancement; Multi-scale product analysis; Sparse matrix; Principal components analysis;
D O I
暂无
中图分类号
学科分类号
摘要
The approach presented here in relies on a new voicing decision algorithm based on the multi-scale product (MP) characteristics. The MP is based on the multiplication of Wavelet Transform Coefficients at some scales. According to the voicing decision, improved subspace decomposition is operated on the voiced segments of the noisy speech signal and a multi-scale principal component analysis is applied on the unvoiced segments of the same signal. Furthermore, the voiced frames are decomposed into three subspaces: sparse, low rank, and the remainder noise components. Then, we calculate the components as a segregation problem. In the unvoiced frames, we combine the straightforward multivariate generalization of the wavelet denoising technique with the principal component analysis method. Experiments on NOIZEUS and NTT databases show that the proposed approach obtains satisfying results for most types of noise with little speech degradation and outperforms several competitive methods.
引用
收藏
页码:1912 / 1933
页数:21
相关论文
共 58 条
  • [1] Abramson A(2007)Simultaneous detection and estimation approach for speech enhancement IEEE Trans. Speech Audio Process. 15 2348-2359
  • [2] Cohen I(2006)Wavelet speech enhancement based on time-scale adaptation Speech Commun. 48 1620-1637
  • [3] Bahoura M(2015)Speech enhancement based on wavelet transform and improved subspace decomposition J. Audio Eng. Soc. 63 990-1000
  • [4] Rouat J(2016)Inverse filtering and principal component analysis techniques for speech dereverberation Adv. Electr. Electron. Eng. 14 129-138
  • [5] Ben Messaoud MA(1979)Suppression of acoustic noise in speech using spectral subtraction IEEE Trans. Acoust. Speech. Signal Process. 27 113-120
  • [6] Bouzid A(2008)A sawtooth waveform inspired pitch estimator for speech and music J. Acoust. Soc. Am. 124 1638-1652
  • [7] Ben Messaoud MA(2011)Robust principal component analysis? J. ACM. 58 1-37
  • [8] Bouzid A(2015)Speech enhancement based on Bayesian decision and spectral amplitude estimation EURASIP J. Audio Speech Music Process. 28 1-18
  • [9] Boll SF(1995)Adapting to unknown smoothness via wavelet shrinkage J. Am Stat. Assoc. 90 1200-1224
  • [10] Camacho A(2007)Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors IEEE Trans. Audio Speech Lang Process. 15 1741-1752