Postfilter for Dual Channel Speech Enhancement Using Coherence and Statistical Model-Based Noise Estimation

被引：1

作者：

Cheong, Sein ^{[1
]}

Kim, Minseung ^{[1
]}

Shin, Jong Won ^{[1
]}

机构：

[1] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju 61005, South Korea

来源：

SENSORS | 2024年 / 24卷 / 12期

关键词：

noise PSD estimation; coherence; dual channel speech enhancement; postfilter; speech presence probability estimation;

D O I：

10.3390/s24123979

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

A multichannel speech enhancement system usually consists of spatial filters such as adaptive beamformers followed by postfilters, which suppress remaining noise. Accurate estimation of the power spectral density (PSD) of the residual noise is crucial for successful noise reduction in the postfilters. In this paper, we propose a postfilter utilizing proposed a posteriori speech presence probability (SPP) and noise PSD estimators, which are based on both the coherence and the statistical models. We model the coherence-based a posteriori SPP as a simple function of the magnitude of coherence between two microphone signals and combine it with a single-channel SPP based on statistical models. The coherence-based estimator for the PSD of the noise remaining in the beamformer output in the presence of speech is derived using the pseudo-coherence considering the effect of the beamformers, which is used to construct the coherence-based noise PSD estimator. Then, the final noise PSD estimator is obtained by combining the coherence-based and statistical model-based noise PSD estimators with the proposed SPP. The spectral gain function is also modified, incorporating the proposed SPP. Experimental results demonstrate that the proposed method led to more accurate noise PSD estimation and perceptual evaluation of speech quality scores in various diffuse noise environments, and did not degrade the speech quality under the presence of directional interference, although the proposed method utilizes the coherence information.

引用

页数：17

共 49 条

[1] [Anonymous], 2007, Wideband Extension to Recommendation P.862 for the Assessment of Wideband Telephone Networks and Speech Codec
[2] [Anonymous], 2014, ETSI TS 126 132
[3] [Anonymous], 2008, ETSI ES 202 396-1
[4] Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1
[5] Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech
Cauchi, Benjamin
Kodrasi, Ina
Rehr, Robert
Gerlach, Stephan
Jukic, Ante
Gerkmann, Timo
Doclo, Simon
Goetze, Stefan
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
[6] Speech Enhancement Based on Beamforming and Post-Filtering by Combining Phase Information
Cheng, Rui
Bao, Changchun
[J]. INTERSPEECH 2020, 2020, : 4496 - 4500
[7] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
Cohen, I
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
[8] Noise estimation by minima controlled recursive averaging for robust speech enhancement
Cohen, I
Berdugo, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
[9] Speech enhancement for non-stationary noise environments
Cohen, I
Berdugo, B
[J]. SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
[10] SPATIAL-CORRELATION FUNCTIONS FOR VARIOUS NOISE MODELS
CRON, BF
SHERMAN, CH
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (11) : 1732 - &

← 1 2 3 4 5 →