Joint Detection and Estimation of Speech Spectral Amplitude Using Noncontinuous Gain Functions

被引：17

作者：

Momeni, Hajar ^{[1
]}

Abutalebi, Hamid Reza ^{[1
]}

Tadaion, Aliakbar ^{[1
]}

机构：

[1] Yazd Univ, Dept Elect & Comp Engn, Yazd 89195741, Iran

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2015年 / 23卷 / 08期

关键词：

Joint detection and estimation; spectral amplitude estimation; speech detection; speech enhancement;

D O I：

10.1109/TASLP.2015.2427522

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the joint detection and estimation approach for single-channel speech enhancement. In this approach, a detector decides on speech presence in each time-frequency unit and an estimator estimates the corresponding speech spectral amplitude. We utilize the concept of binary/continuous gain functions to study and extend the process of joint detection and estimation. The binary gains (BGs) have already shown an inferior performance compared to the continuous gains (CGs). In this paper, we propose a simultaneous detection and estimation (SDE) method where the detector structure is derived by the knowledge of the estimator. The proposed SDE method is a combination of Bayesian and Neyman-Pearson approaches and is expressed as a noncontinuous gain (NCG). It is expected that employing a superior detector, the proposed NCG improves the quality of the output speech. We concentrate on the derivation of the detector so that it minimizes the error caused by missed detection and/or wrong estimation of speech coefficients at a controlled level of falsely detecting high-energy noise as speech. Furthermore, an independent detection and estimation technique is proposed where the detector and the estimator are extracted in an independent manner. Simulation results demonstrate that the proposed SDE method minimizes the speech distortion at a controlled level of noise reduction. It is also shown that the performance of the proposed NCG is better than the CG and than the existing BGs in both noise reduction and speech distortion aspects.

引用

页码：1249 / 1258

页数：10

共 21 条

[1] Simultaneous detection and estimation approach for speech enhancement
Abramson, Ari
Cohen, Israel
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2348 - 2359
[2] Voice activity detection based on multiple statistical models
Chang, Joon-Hyuk
Kim, Nam Soo
Mitra, Sanjit K.
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 1965 - 1976
[3] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
Cohen, I
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 466 - 475
[4] Noise estimation by minima controlled recursive averaging for robust speech enhancement
Cohen, I
Berdugo, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
[5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
[6] Garofolo J., 1988, Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database
[7] Speech enhancement employing Laplacian-Gaussian mixture
Gazor, S
Zhang, W
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 896 - 904
[8] A soft voice activity detector based on a Laplacian-Gaussian model
Gazor, S
Zhang, W
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 498 - 505
[9] Evaluation of objective quality measures for speech enhancement
Hu, Yi
Loizou, Philipos C.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 229 - 238
[10] Spectral Magnitude Minimum Mean-Square Error Estimation Using Binary and Continuous Gain Functions
Jensen, Jesper
Hendriks, Richard C.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 92 - 102

← 1 2 3 →