Minimum mean square error estimator for speech enhancement in additive noise assuming Weibull speech priors and speech presence uncertainty

被引：4

作者：

Bahrami, Mojtaba ^{[1
]}

Faraji, Neda ^{[1
]}

机构：

[1] Imam Khomeini Int Univ, Dept Elect Engn, Qazvin, Iran

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2021年 / 24卷 / 01期

关键词：

Speech enhancement; Weibull distribution; Speech presence uncertainty; Minimum mean square error estimation; A-POSTERIORI; AMPLITUDE;

D O I：

10.1007/s10772-020-09767-y

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A novel single-channel technique was proposed based on a minimum mean square error (MMSE) estimator to enhance short-time spectral amplitude (STSA) in the Discrete Fourier Transform (DFT) domain. In the present contribution, a Weibull distribution was used to model DFT magnitudes of clean speech signals under the additive Gaussian noise assumption. Moreover, the speech enhancement procedure was conducted with (WSPU) and without speech presence uncertainty (WoSPU). The theoretical spectral gain function was obtained as a weighted geometric mean of hypothetical gains associated with signal presence and absence. Extensive experiments were conducted with clean speech signals taken from the TIMIT database, which had been degraded by various additive non-stationary noise sources, and then enhanced signals were evaluated. The evaluation results demonstrated the outperformance of the proposed method compared to the probability density functions (PDF) of Rayleigh and Gamma distributions in terms of segmental signal-to-noise ratio (segSNR), general SNR, and perceptual evaluation of speech quality (PESQ). The performance in the WSPU case was also significantly improved compared to WoSPU, assuming Weibull speech priors in the MMSE-STSA based speech enhancement algorithm.

引用

页码：97 / 108

页数：12

共 41 条

[1] Speech enhancement with an adaptive Wiener filter [J].

Abd El-Fattah, Marwa ;

Dessouky, Moawad ;

Abbas, Alaa ;

Diab, Salaheldin ;

El-Rabaie, El-Sayed ;

Al-Nuaimy, Waleed ;

Alshebeili, Saleh ;

Abd El-Samie, Fathi .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) :53-64

[2]

Andrianakis I, 2006, INT CONF ACOUST SPEE, P3519

[3]

[Anonymous], 1988, OBJECTIVE MEASURES S

[4]

[Anonymous], 2009, INTERSPEECH

[5]

[Anonymous], 2010, INT WORKSH AC ECH NO

[6]

Bahrami M, 2018, IRAN CONF ELECTR ENG, P749, DOI 10.1109/ICEE.2018.8472626

[7]

Bahrami M, 2017, 2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), P190, DOI 10.1109/AISP.2017.8324079

[8]

Brillinger D. R., 2001, Time Series: Data Analysis and Theory, DOI DOI 10.1137/1.9780898719246

[9] Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation [J].

Chehrehsa, Sarang ;

Moir, Tom James .

COMPUTER SPEECH AND LANGUAGE, 2016, 36 :58-71

[10] A Laplacian-based MMSE estimator for speech enhancement [J].

Chen, Bin ;

Loizou, Philipos C. .

SPEECH COMMUNICATION, 2007, 49 (02) :134-143

← 1 2 3 4 5 →