The Effect of Spectral Estimation on Speech Enhancement Performance

被引:6
|
作者
Charoenruengkit, Werayuth [1 ,2 ]
Erdoel, Nurguen [1 ]
机构
[1] Florida Atlantic Univ, Dept Elect Engn, Boca Raton, FL 33431 USA
[2] IBM Corp, Boca Raton, FL 33487 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 05期
关键词
Spectral estimation; speech communication; speech enhancement; speech processing; NOISE; SUPPRESSION; REDUCTION;
D O I
10.1109/TASL.2010.2087750
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It has long been observed that accuracy in spectral estimation greatly affects the quality of enhanced speech. A small decrease in the bias and variance of the estimator can greatly reduce the amount of residual noise and distortion in the recovered speech. To date, however, there has been little interest in a rigorous analysis quantifying such observations. In this paper, we analyze the effect of spectral estimate variance on enhanced speech as measured by quantitative and qualitative means. The performance analysis is derived for the signal subspace and the minimum mean square error short-time spectral amplitude estimators. Error is defined as the random function of frequency given by the difference between the estimated and the true power spectral density (PSD) functions. It is measured by its variance as a fraction of the clean speech PSD squared: a norm called the variance quality factor (VQF). The error VQF is derived in terms of the VQF of measurable quantities such as noisy speech and noise alone. It is shown that reducing the PSD estimate variance reduces significantly the VQF of the enhancement error. We provide analytical derivations to establish the results and accompanying simulations to confirm the theoretical analysis. Simulations test the periodogram, Blackman-Tukey, Bartlett-Welch, and Multitaper spectral estimation methods.
引用
收藏
页码:1170 / 1179
页数:10
相关论文
共 50 条
  • [1] A Speech Enhancement Method by Coupling Speech Detection and Spectral Amplitude Estimation
    Deng, Feng
    Bao, Chang-Chun
    Bao, Feng
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3233 - 3237
  • [2] β-order MMSE spectral amplitude estimation for speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
  • [3] Speech Enhancement Using a Risk Estimation Approach
    Sadasivan, Jishnu
    Seelamantula, Chandra Sekhar
    Muraka, Nagarjuna Reddy
    SPEECH COMMUNICATION, 2020, 116 : 12 - 29
  • [4] Speech enhancement based on Bayesian decision and spectral amplitude estimation
    Deng, Feng
    Bao, Chang-Chun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [5] Generalized maximum a posteriori spectral amplitude estimation for speech enhancement
    Tsao, Yu
    Lai, Ying-Hui
    SPEECH COMMUNICATION, 2016, 76 : 112 - 126
  • [6] Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
    Saleem, N.
    Khattak, M. I.
    Perez, E. V.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2019, 64 (12) : 1372 - 1382
  • [7] Speech enhancement by spectral magnitude estimation - A unifying approach
    Xie, F
    VanCompernolle, D
    SPEECH COMMUNICATION, 1996, 19 (02) : 89 - 104
  • [8] Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement
    Gupta, Monika
    Singh, R. K.
    Singh, Sachin
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 128 (03) : 2203 - 2215
  • [9] BAYESIAN SPECTRAL AMPLITUDE ESTIMATION FOR SPEECH ENHANCEMENT WITH CORRELATED SPECTRAL COMPONENTS
    Plourde, Eric
    Champagne, Benoit
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 397 - 400
  • [10] Speech enhancement based on β-order MMSE estimation of Short Time Spectral Amplitude and Laplacian speech modeling
    Abutalebi, Hamid Reza
    Rashidinejad, Mehdi
    SPEECH COMMUNICATION, 2015, 67 : 92 - 101