Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement

被引：3

作者：

Ou Shifeng ^{[1
]}

Song Peng ^{[2
]}

Gao Ying ^{[1
]}

机构：

[1] Yantai Univ, Sch Sci & Technol Optoelect Informat, Yantai 264005, Peoples R China

[2] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China

来源：

CHINESE JOURNAL OF ELECTRONICS | 2018年 / 27卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Noise PSD estimation; Speech enhancement; Laplacian speech model; Soft decision; LOW-COMPLEXITY; ALGORITHM;

D O I：

10.1049/cje.2018.09.009

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The estimation of noise Power spectral density (PSD) is a very crucial issue for speech enhancement as a result of its significant effect on the quality and intelligibility of the enhanced speech. Most of the existing estimators for noise PSD try to employ Gaussian speech priors, which, however, have been proven inconsistent with the reality. We derived an effective solution to this problem of estimating noise PSD in the Minimum mean square error (MMSE) sense when the speech component is modeled by a Laplacian distribution. Meanwhile, the soft decision technique instead of the hard Voice activity detection (VAD) is evolved into our algorithm, which can automatically makes the estimation unbiased without requiring a bias compensation. The performance of the proposed method is tested by several objective and subjective measures under various stationary and nonstationary noise environments. The results confirm that our method achieves good performance for all the noise conditions and Signal noise-ratio (SNR) settings.

引用

页码：1214 / 1220

页数：7

共 22 条

[1] [Anonymous], 2013, COMPUT REV
[2] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
BOLL, SF
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
[3] Voice activity detection based on multiple statistical models
Chang, Joon-Hyuk
Kim, Nam Soo
Mitra, Sanjit K.
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 1965 - 1976
[4] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
Cohen, I
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
[5] Noise estimation by minima controlled recursive averaging for robust speech enhancement
Cohen, I
Berdugo, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
[6] Fan NP, 2007, INT CONF ACOUST SPEE, P581
[7] Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
Gerkmann, Timo
Hendriks, Richard C.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1383 - 1393
[8] Noise tracking using DFT domain subspace decompositions
Hendriks, Richard C.
Jensen, Jesper
Heusdens, Richard
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 541 - 553
[9] MMSE BASED NOISE PSD TRACKING WITH LOW COMPLEXITY
Hendriks, Richard C.
Heusdens, Richard
Jensen, Jesper
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4266 - 4269
[10] A cepstrum-based preprocessing and postprocessing for speech enhancement in adverse environments
Hu, Xiaohu
Wang, Shiwei
Zheng, Chengshi
Li, Xiaodong
[J]. APPLIED ACOUSTICS, 2013, 74 (12) : 1458 - 1462

← 1 2 3 →