Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement

被引：0

作者：

OU Shifeng ^{[1
]}

SONG Peng ^{[2
]}

GAO Ying ^{[1
]}

机构：

[1] School of Science and Technology for Opto-electronic Information, Yantai University

[2] School of Computer and Control Engineering, Yantai University

来源：

Chinese Journal of Electronics | 2018年 / 27卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Noise PSD estimation; Speech enhancement; Laplacian speech model; Soft decision;

D O I：

暂无

中图分类号：

TN912.35 [语音增强];

学科分类号：

0711 ;

摘要：

The estimation of noise Power spectral density(PSD) is a very crucial issue for speech enhancement as a result of its significant effect on the quality and intelligibility of the enhanced speech. Most of the existing estimators for noise PSD try to employ Gaussian speech priors, which, however, have been proven inconsistent with the reality. We derived an effective solution to this problem of estimating noise PSD in the Minimum mean square error(MMSE) sense when the speech component is modeled by a Laplacian distribution. Meanwhile, the soft decision technique instead of the hard Voice activity detection(VAD) is evolved into our algorithm, which can automatically makes the estimation unbiased without requiring a bias compensation. The performance of the proposed method is tested by several objective and subjective measures under various stationary and nonstationary noise environments. The results confirm that our method achieves good performance for all the noise conditions and Signalnoise-ratio(SNR) settings.

引用

页码：1214 / 1220

页数：7

共 10 条

[1] A coherence-based noise reduction algorithm for binaural hearing aids[J] . Nima Yousefian,Philipos C. Loizou,John H.L. Hansen.Speech Communication . 2014
[2] A study of voice activity detection techniques for NIST speaker recognition evaluations
Mak, Man-Wai
Yu, Hon-Bill
[J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) : 295 - 313
[3] Perceptual subspace speech enhancement using variance of the reconstruction error[J] . Adda Saadoune,Abderrahmane Amrouche,Sid-Ahmed Selouani.Digital Signal Processing . 2014
[4] A cepstrum-based preprocessing and postprocessing for speech enhancement in adverse environments
Hu, Xiaohu
Wang, Shiwei
Zheng, Chengshi
Li, Xiaodong
[J]. APPLIED ACOUSTICS, 2013, 74 (12) : 1458 - 1462
[5] Effects of telephone transmission on the performance of formant-trajectory-based forensic voice comparison – Female voices[J] . Cuiling Zhang,Geoffrey Stewart Morrison,Ewald Enzinger,Felipe Ochoa.Speech Communication . 2013 (6)
[6] Speech enhancement using hidden Markov models in Mel-frequency domain
Veisi, Hadi
Sameti, Hossein
[J]. SPEECH COMMUNICATION, 2013, 55 (02) : 205 - 220
[7] Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
Gerkmann, Timo
Hendriks, Richard C.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1383 - 1393
[8] A low-complexity noise estimation algorithm based on smoothing of noise powerestimation and estimation bias correction .2 Yu R. IEEE International Conference on Acoustics,Speech, Signal Processing . 2009
[9] Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging .2 Cohen,I. Speech and Audio Processing, IEEE Transactions on . 2003
[10] A Probabilistic Combination Method of Minimum Statistics and Soft Decision for Robust Noise Power Estimation in Speech Enhancement .2 Yun-Sik Park,Joon-Hyuk Chang. Signal Processing Letters, IEEE . 2008

← 1 →