Joint Detection and Estimation of Speech Spectral Amplitude Using Noncontinuous Gain Functions

被引:17
作者
Momeni, Hajar [1 ]
Abutalebi, Hamid Reza [1 ]
Tadaion, Aliakbar [1 ]
机构
[1] Yazd Univ, Dept Elect & Comp Engn, Yazd 89195741, Iran
关键词
Joint detection and estimation; spectral amplitude estimation; speech detection; speech enhancement;
D O I
10.1109/TASLP.2015.2427522
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the joint detection and estimation approach for single-channel speech enhancement. In this approach, a detector decides on speech presence in each time-frequency unit and an estimator estimates the corresponding speech spectral amplitude. We utilize the concept of binary/continuous gain functions to study and extend the process of joint detection and estimation. The binary gains (BGs) have already shown an inferior performance compared to the continuous gains (CGs). In this paper, we propose a simultaneous detection and estimation (SDE) method where the detector structure is derived by the knowledge of the estimator. The proposed SDE method is a combination of Bayesian and Neyman-Pearson approaches and is expressed as a noncontinuous gain (NCG). It is expected that employing a superior detector, the proposed NCG improves the quality of the output speech. We concentrate on the derivation of the detector so that it minimizes the error caused by missed detection and/or wrong estimation of speech coefficients at a controlled level of falsely detecting high-energy noise as speech. Furthermore, an independent detection and estimation technique is proposed where the detector and the estimator are extracted in an independent manner. Simulation results demonstrate that the proposed SDE method minimizes the speech distortion at a controlled level of noise reduction. It is also shown that the performance of the proposed NCG is better than the CG and than the existing BGs in both noise reduction and speech distortion aspects.
引用
收藏
页码:1249 / 1258
页数:10
相关论文
共 21 条
  • [1] Simultaneous detection and estimation approach for speech enhancement
    Abramson, Ari
    Cohen, Israel
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2348 - 2359
  • [2] Voice activity detection based on multiple statistical models
    Chang, Joon-Hyuk
    Kim, Nam Soo
    Mitra, Sanjit K.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 1965 - 1976
  • [3] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    Cohen, I
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
  • [4] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
  • [5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [6] Garofolo J., 1988, Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database
  • [7] Speech enhancement employing Laplacian-Gaussian mixture
    Gazor, S
    Zhang, W
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 896 - 904
  • [8] A soft voice activity detector based on a Laplacian-Gaussian model
    Gazor, S
    Zhang, W
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 498 - 505
  • [9] Evaluation of objective quality measures for speech enhancement
    Hu, Yi
    Loizou, Philipos C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 229 - 238
  • [10] Spectral Magnitude Minimum Mean-Square Error Estimation Using Binary and Continuous Gain Functions
    Jensen, Jesper
    Hendriks, Richard C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 92 - 102