Postfilter for Dual Channel Speech Enhancement Using Coherence and Statistical Model-Based Noise Estimation

被引:1
作者
Cheong, Sein [1 ]
Kim, Minseung [1 ]
Shin, Jong Won [1 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea
关键词
noise PSD estimation; coherence; dual channel speech enhancement; postfilter; speech presence probability estimation;
D O I
10.3390/s24123979
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A multichannel speech enhancement system usually consists of spatial filters such as adaptive beamformers followed by postfilters, which suppress remaining noise. Accurate estimation of the power spectral density (PSD) of the residual noise is crucial for successful noise reduction in the postfilters. In this paper, we propose a postfilter utilizing proposed a posteriori speech presence probability (SPP) and noise PSD estimators, which are based on both the coherence and the statistical models. We model the coherence-based a posteriori SPP as a simple function of the magnitude of coherence between two microphone signals and combine it with a single-channel SPP based on statistical models. The coherence-based estimator for the PSD of the noise remaining in the beamformer output in the presence of speech is derived using the pseudo-coherence considering the effect of the beamformers, which is used to construct the coherence-based noise PSD estimator. Then, the final noise PSD estimator is obtained by combining the coherence-based and statistical model-based noise PSD estimators with the proposed SPP. The spectral gain function is also modified, incorporating the proposed SPP. Experimental results demonstrate that the proposed method led to more accurate noise PSD estimation and perceptual evaluation of speech quality scores in various diffuse noise environments, and did not degrade the speech quality under the presence of directional interference, although the proposed method utilizes the coherence information.
引用
收藏
页数:17
相关论文
共 49 条
  • [1] [Anonymous], 2007, Wideband Extension to Recommendation P.862 for the Assessment of Wideband Telephone Networks and Speech Codec
  • [2] [Anonymous], 2014, ETSI TS 126 132
  • [3] [Anonymous], 2008, ETSI ES 202 396-1
  • [4] Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1
  • [5] Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech
    Cauchi, Benjamin
    Kodrasi, Ina
    Rehr, Robert
    Gerlach, Stephan
    Jukic, Ante
    Gerkmann, Timo
    Doclo, Simon
    Goetze, Stefan
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [6] Speech Enhancement Based on Beamforming and Post-Filtering by Combining Phase Information
    Cheng, Rui
    Bao, Changchun
    [J]. INTERSPEECH 2020, 2020, : 4496 - 4500
  • [7] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    Cohen, I
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
  • [8] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
  • [9] Speech enhancement for non-stationary noise environments
    Cohen, I
    Berdugo, B
    [J]. SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
  • [10] SPATIAL-CORRELATION FUNCTIONS FOR VARIOUS NOISE MODELS
    CRON, BF
    SHERMAN, CH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (11) : 1732 - &