The Effect of Spectral Estimation on Speech Enhancement Performance

被引：6

作者：

Charoenruengkit, Werayuth ^{[1
,2
]}

Erdoel, Nurguen ^{[1
]}

机构：

[1] Florida Atlantic Univ, Dept Elect Engn, Boca Raton, FL 33431 USA

[2] IBM Corp, Boca Raton, FL 33487 USA

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 05期

关键词：

Spectral estimation; speech communication; speech enhancement; speech processing; NOISE; SUPPRESSION; REDUCTION;

D O I：

10.1109/TASL.2010.2087750

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

It has long been observed that accuracy in spectral estimation greatly affects the quality of enhanced speech. A small decrease in the bias and variance of the estimator can greatly reduce the amount of residual noise and distortion in the recovered speech. To date, however, there has been little interest in a rigorous analysis quantifying such observations. In this paper, we analyze the effect of spectral estimate variance on enhanced speech as measured by quantitative and qualitative means. The performance analysis is derived for the signal subspace and the minimum mean square error short-time spectral amplitude estimators. Error is defined as the random function of frequency given by the difference between the estimated and the true power spectral density (PSD) functions. It is measured by its variance as a fraction of the clean speech PSD squared: a norm called the variance quality factor (VQF). The error VQF is derived in terms of the VQF of measurable quantities such as noisy speech and noise alone. It is shown that reducing the PSD estimate variance reduces significantly the VQF of the enhancement error. We provide analytical derivations to establish the results and accompanying simulations to confirm the theoretical analysis. Simulations test the periodogram, Blackman-Tukey, Bartlett-Welch, and Multitaper spectral estimation methods.

引用

页码：1170 / 1179

页数：10

共 50 条

[1] A Speech Enhancement Method by Coupling Speech Detection and Spectral Amplitude Estimation
Deng, Feng
Bao, Chang-Chun
Bao, Feng
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3233 - 3237
[2] β-order MMSE spectral amplitude estimation for speech enhancement
You, CH
Koh, SN
Rahardja, S
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
[3] Speech Enhancement Using a Risk Estimation Approach
Sadasivan, Jishnu
Seelamantula, Chandra Sekhar
Muraka, Nagarjuna Reddy
SPEECH COMMUNICATION, 2020, 116 : 12 - 29
[4] Speech enhancement based on Bayesian decision and spectral amplitude estimation
Deng, Feng
Bao, Chang-Chun
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
[5] Generalized maximum a posteriori spectral amplitude estimation for speech enhancement
Tsao, Yu
Lai, Ying-Hui
SPEECH COMMUNICATION, 2016, 76 : 112 - 126
[6] Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
Saleem, N.
Khattak, M. I.
Perez, E. V.
JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2019, 64 (12) : 1372 - 1382
[7] Speech enhancement by spectral magnitude estimation - A unifying approach
Xie, F
VanCompernolle, D
SPEECH COMMUNICATION, 1996, 19 (02) : 89 - 104
[8] Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement
Gupta, Monika
Singh, R. K.
Singh, Sachin
WIRELESS PERSONAL COMMUNICATIONS, 2023, 128 (03) : 2203 - 2215
[9] BAYESIAN SPECTRAL AMPLITUDE ESTIMATION FOR SPEECH ENHANCEMENT WITH CORRELATED SPECTRAL COMPONENTS
Plourde, Eric
Champagne, Benoit
2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 397 - 400
[10] Speech enhancement based on β-order MMSE estimation of Short Time Spectral Amplitude and Laplacian speech modeling
Abutalebi, Hamid Reza
Rashidinejad, Mehdi
SPEECH COMMUNICATION, 2015, 67 : 92 - 101

← 1 2 3 4 5 →