The Effect of Spectral Estimation on Speech Enhancement Performance

被引:6
|
作者
Charoenruengkit, Werayuth [1 ,2 ]
Erdoel, Nurguen [1 ]
机构
[1] Florida Atlantic Univ, Dept Elect Engn, Boca Raton, FL 33431 USA
[2] IBM Corp, Boca Raton, FL 33487 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 05期
关键词
Spectral estimation; speech communication; speech enhancement; speech processing; NOISE; SUPPRESSION; REDUCTION;
D O I
10.1109/TASL.2010.2087750
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It has long been observed that accuracy in spectral estimation greatly affects the quality of enhanced speech. A small decrease in the bias and variance of the estimator can greatly reduce the amount of residual noise and distortion in the recovered speech. To date, however, there has been little interest in a rigorous analysis quantifying such observations. In this paper, we analyze the effect of spectral estimate variance on enhanced speech as measured by quantitative and qualitative means. The performance analysis is derived for the signal subspace and the minimum mean square error short-time spectral amplitude estimators. Error is defined as the random function of frequency given by the difference between the estimated and the true power spectral density (PSD) functions. It is measured by its variance as a fraction of the clean speech PSD squared: a norm called the variance quality factor (VQF). The error VQF is derived in terms of the VQF of measurable quantities such as noisy speech and noise alone. It is shown that reducing the PSD estimate variance reduces significantly the VQF of the enhancement error. We provide analytical derivations to establish the results and accompanying simulations to confirm the theoretical analysis. Simulations test the periodogram, Blackman-Tukey, Bartlett-Welch, and Multitaper spectral estimation methods.
引用
收藏
页码:1170 / 1179
页数:10
相关论文
共 50 条
  • [31] An Iterative Graph Spectral Subtraction Method for Speech Enhancement
    Yan, Xue
    Yang, Zhen
    Wang, Tingting
    Guo, Haiyan
    SPEECH COMMUNICATION, 2020, 123 : 35 - 42
  • [32] Masking Estimation with Phase Restoration of Clean Speech for Monaural Speech Enhancement
    Wang, Xianyun
    Bao, Changchun
    INTERSPEECH 2019, 2019, : 3188 - 3192
  • [33] Beta-order minimum mean-square error multichannel spectral amplitude estimation for speech enhancement
    Trawicki, M. B.
    Johnson, M. T.
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2015, 29 (10) : 1287 - 1295
  • [34] Spectral-domain speech enhancement for speech recognition
    You, Chang Huai
    Ma, Bin
    SPEECH COMMUNICATION, 2017, 94 : 30 - 41
  • [35] Enhancement of Spectral Tilt in Synthesized Speech
    Sharma, Bidisha
    Prasanna, S. R. Mahadeva
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (04) : 382 - 386
  • [36] Speech enhancement by spectral component selection
    Wei, W
    Chen, YP
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 674 - 678
  • [37] Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
    N. Saleem
    M. I. Khattak
    E. V. Perez
    Journal of Communications Technology and Electronics, 2019, 64 : 1372 - 1382
  • [38] Speech enhancement with adaptive spectral estimators
    Y. Sandoval-Ibarra
    V. H. Diaz-Ramirez
    V. I. Kober
    V. N. Karnaukhov
    Journal of Communications Technology and Electronics, 2016, 61 : 672 - 678
  • [39] Speech enhancement based on soft audible noise masking and noise power estimation
    Yu, Rongshan
    SPEECH COMMUNICATION, 2013, 55 (10) : 964 - 974
  • [40] Speech Enhancement By Minimum Mean-Square Error Spectral Amplitude Estimation Assuming Weibull Speech Priors
    Bahrami, Mojtaba
    Faraji, Neda
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 190 - 194