SPECTRAL MAGNITUDE MINIMUM MEAN-SQUARE ERROR BINARY MASKS FOR DFT BASED SPEECH ENHANCEMENT

被引:0
作者
Jensen, Jesper
Hendriks, Richard C.
机构
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
Speech enhancement; binary masks; minimum mean-square error; intelligibility; INTELLIGIBILITY;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Originally, ideal binary mask (idbm) techniques have been used as a tool for studying aspects of the auditory system. More recently, idbm techniques have been adapted to the practical problem of retrieving a target speech signal from a noisy observation. In this practical setting, the binary mask techniques show similarities with existing DFT based speech enhancement techniques. In this context, we derive single-channel, binary mask estimators which minimize the spectral magnitude mean-square error. We show in simulation experiments with natural speech and noise signals that the proposed estimators perform significantly better than existing binary mask estimators. However, even the best of the proposed estimators is clearly outperformed by non-binary estimators, both in terms of speech quality and intelligibility.
引用
收藏
页码:4736 / 4739
页数:4
相关论文
共 18 条
  • [1] [Anonymous], 2001, PERC EV SPEECH QUAL
  • [2] Brillinger D. R., 1981, Time Series: Data Analysis and Theory
  • [3] Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation
    Brungart, Douglas S.
    Chang, Peter S.
    Simpson, Brian D.
    Wang, DeLiang
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06) : 4007 - 4018
  • [4] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [5] Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors
    Erkelens, Jan S.
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1741 - 1752
  • [6] Gradshteyn S., 2014, Table of Integrals, Series, and Products, V8th
  • [7] Low Complexity DFT-Domain Noise PSD Tracking Using High-Resolution Periodograms
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    Kjems, Ulrik
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
  • [8] Speech segregation based on pitch tracking and amplitude modulation
    Hu, GN
    Wang, DL
    [J]. PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, : 79 - 82
  • [9] Hu K, 2007, INT CONF ACOUST SPEE, P561
  • [10] Hu Y, 2006, INT CONF ACOUST SPEE, P153