Speech enhancement based on the discrete Gabor transform and multi-notch adaptive digital filters

被引:10
作者
Erçelebi, E [1 ]
机构
[1] Gaziantep Univ, Dept Elect & Elect Engn, TR-27310 Gaziantep, Turkey
关键词
speech enhancement; multi-notch adaptive digital filter; discrete Gabor transform;
D O I
10.1016/j.apacoust.2004.02.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new method to speech enhancement based on time-frequency analysis and adaptive digital filtering. The proposed method for dual-channel speech enhancement was developed by tracking frequencies of corrupting signal by the discrete Gabor transform (DGT) and implementing multi-notch adaptive digital filter (MNADF) at those frequencies. Since no a priori knowledge of the noise source statistics is required this method differs from traditional speech enhancement methods. Specifically, the proposed method was applied to the case where speech quality and intelligibility deteriorate in the presence of background noise. Speech coders and automatic speech recognition (ASR) systems are designed to act on clean speech signals. Therefore, corrupted speech signals by the noise must be enhanced before their processing. The method uses a primary input containing the corrupted speech signal while a reference input containing the noise only. In this paper, we designed MNADF instead of single-notch adaptive digital filter and used DGT to track frequencies of corrupting signal because fast filtering process and fast measure of the time-dependent noise frequency are of great importance in speech enhancement process. Therefore, MNADF was implemented to take advantage of fast filtering process. Different types of noises from Noisex-92 database were used to degrade real speech signals. Objective measures, the study of the speech spectrograms and global signal-to-noise ratio (SNR), segmental SNR (segSNR), Itakura-Saito distance measure as well as subjective listing test demonstrated consistently superior enhancement performance of the proposed method over traditional speech enhancement method such as spectral subtraction. Combining MNADF and DGT, excellent speech enhancement was obtained. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:739 / 762
页数:24
相关论文
共 39 条
[1]   Systolic design of frequency-domain block LMS adaptive digital filters [J].
Alwan, NAS ;
Al-Hashemy, BAR .
COMPUTERS & ELECTRICAL ENGINEERING, 1998, 24 (3-4) :263-275
[2]  
[Anonymous], P IEEE INT C AC SPEE
[3]   THE DISCRETE ZAK TRANSFORM APPLICATION TO TIME-FREQUENCY ANALYSIS AND SYNTHESIS OF NONSTATIONARY SIGNALS [J].
AUSLANDER, L ;
GERTNER, IC ;
TOLIMIERI, R .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :825-835
[4]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[5]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING 2 MICROPHONE ADAPTIVE NOISE CANCELLATION [J].
BOLL, SF ;
PULSIPHER, DC .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (06) :752-753
[6]   FAST, RECURSIVE-LEAST-SQUARES TRANSVERSAL FILTERS FOR ADAPTIVE FILTERING [J].
CIOFFI, JM ;
KAILATH, T .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02) :304-337
[7]   Adaptive noise cancellation in a multimicrophone system for distortion product otoacoustic emission acquisition [J].
Delgado, RE ;
Özdamar, Ö ;
Rahman, S ;
Lopez, CN .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2000, 47 (09) :1154-1164
[8]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[9]  
Feder M., 1987, Proceedings: ICASSP 87. 1987 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.87CH2396-0), P201
[10]   MAXIMUM-LIKELIHOOD NOISE CANCELLATION USING THE EM ALGORITHM [J].
FEDER, M ;
OPPENHEIM, AV ;
WEINSTEIN, E .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (02) :204-216