The Effect of Spectral Estimation on Speech Enhancement Performance

被引：6

作者：

Charoenruengkit, Werayuth ^{[1
,2
]}

Erdoel, Nurguen ^{[1
]}

机构：

[1] Florida Atlantic Univ, Dept Elect Engn, Boca Raton, FL 33431 USA

[2] IBM Corp, Boca Raton, FL 33487 USA

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 05期

关键词：

Spectral estimation; speech communication; speech enhancement; speech processing; NOISE; SUPPRESSION; REDUCTION;

D O I：

10.1109/TASL.2010.2087750

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

It has long been observed that accuracy in spectral estimation greatly affects the quality of enhanced speech. A small decrease in the bias and variance of the estimator can greatly reduce the amount of residual noise and distortion in the recovered speech. To date, however, there has been little interest in a rigorous analysis quantifying such observations. In this paper, we analyze the effect of spectral estimate variance on enhanced speech as measured by quantitative and qualitative means. The performance analysis is derived for the signal subspace and the minimum mean square error short-time spectral amplitude estimators. Error is defined as the random function of frequency given by the difference between the estimated and the true power spectral density (PSD) functions. It is measured by its variance as a fraction of the clean speech PSD squared: a norm called the variance quality factor (VQF). The error VQF is derived in terms of the VQF of measurable quantities such as noisy speech and noise alone. It is shown that reducing the PSD estimate variance reduces significantly the VQF of the enhancement error. We provide analytical derivations to establish the results and accompanying simulations to confirm the theoretical analysis. Simulations test the periodogram, Blackman-Tukey, Bartlett-Welch, and Multitaper spectral estimation methods.

引用

页码：1170 / 1179

页数：10

共 50 条

[31] An Iterative Graph Spectral Subtraction Method for Speech Enhancement
Yan, Xue
Yang, Zhen
Wang, Tingting
Guo, Haiyan
SPEECH COMMUNICATION, 2020, 123 : 35 - 42
[32] Masking Estimation with Phase Restoration of Clean Speech for Monaural Speech Enhancement
Wang, Xianyun
Bao, Changchun
INTERSPEECH 2019, 2019, : 3188 - 3192
[33] Beta-order minimum mean-square error multichannel spectral amplitude estimation for speech enhancement
Trawicki, M. B.
Johnson, M. T.
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2015, 29 (10) : 1287 - 1295
[34] Spectral-domain speech enhancement for speech recognition
You, Chang Huai
Ma, Bin
SPEECH COMMUNICATION, 2017, 94 : 30 - 41
[35] Enhancement of Spectral Tilt in Synthesized Speech
Sharma, Bidisha
Prasanna, S. R. Mahadeva
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (04) : 382 - 386
[36] Speech enhancement by spectral component selection
Wei, W
Chen, YP
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 674 - 678
[37] Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
N. Saleem
M. I. Khattak
E. V. Perez
Journal of Communications Technology and Electronics, 2019, 64 : 1372 - 1382
[38] Speech enhancement with adaptive spectral estimators
Y. Sandoval-Ibarra
V. H. Diaz-Ramirez
V. I. Kober
V. N. Karnaukhov
Journal of Communications Technology and Electronics, 2016, 61 : 672 - 678
[39] Speech enhancement based on soft audible noise masking and noise power estimation
Yu, Rongshan
SPEECH COMMUNICATION, 2013, 55 (10) : 964 - 974
[40] Speech Enhancement By Minimum Mean-Square Error Spectral Amplitude Estimation Assuming Weibull Speech Priors
Bahrami, Mojtaba
Faraji, Neda
2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 190 - 194

← 1 2 3 4 5 →