Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement

被引:3
作者
Ou Shifeng [1 ]
Song Peng [2 ]
Gao Ying [1 ]
机构
[1] Yantai Univ, Sch Sci & Technol Optoelect Informat, Yantai 264005, Peoples R China
[2] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Noise PSD estimation; Speech enhancement; Laplacian speech model; Soft decision; LOW-COMPLEXITY; ALGORITHM;
D O I
10.1049/cje.2018.09.009
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The estimation of noise Power spectral density (PSD) is a very crucial issue for speech enhancement as a result of its significant effect on the quality and intelligibility of the enhanced speech. Most of the existing estimators for noise PSD try to employ Gaussian speech priors, which, however, have been proven inconsistent with the reality. We derived an effective solution to this problem of estimating noise PSD in the Minimum mean square error (MMSE) sense when the speech component is modeled by a Laplacian distribution. Meanwhile, the soft decision technique instead of the hard Voice activity detection (VAD) is evolved into our algorithm, which can automatically makes the estimation unbiased without requiring a bias compensation. The performance of the proposed method is tested by several objective and subjective measures under various stationary and nonstationary noise environments. The results confirm that our method achieves good performance for all the noise conditions and Signal noise-ratio (SNR) settings.
引用
收藏
页码:1214 / 1220
页数:7
相关论文
共 22 条
  • [1] [Anonymous], 2013, COMPUT REV
  • [2] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [3] Voice activity detection based on multiple statistical models
    Chang, Joon-Hyuk
    Kim, Nam Soo
    Mitra, Sanjit K.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 1965 - 1976
  • [4] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    Cohen, I
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
  • [5] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
  • [6] Fan NP, 2007, INT CONF ACOUST SPEE, P581
  • [7] Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
    Gerkmann, Timo
    Hendriks, Richard C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1383 - 1393
  • [8] Noise tracking using DFT domain subspace decompositions
    Hendriks, Richard C.
    Jensen, Jesper
    Heusdens, Richard
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 541 - 553
  • [9] MMSE BASED NOISE PSD TRACKING WITH LOW COMPLEXITY
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4266 - 4269
  • [10] A cepstrum-based preprocessing and postprocessing for speech enhancement in adverse environments
    Hu, Xiaohu
    Wang, Shiwei
    Zheng, Chengshi
    Li, Xiaodong
    [J]. APPLIED ACOUSTICS, 2013, 74 (12) : 1458 - 1462